GLAUBER DYNAMICS FOR THE MEAN-FIELD POTTS MODEL 



P. CUFF, J. DING, O. LOUIDOR, E. LUBETZKY, Y. PERES, AND A. SLY 

Abstract. We study Glauber dynamics for the mean-field (Curie- Weiss) Potts model 
with q ^ 3 states and show that it undergoes a critical slowdown at an inverse-temperature 
I3a{q) strictly lower than the critical /3c(g) for uniqueness of the thermodynamic limit. 
The dynamical critical I3s{q) is the spinodal point marking the onset of metastability. 

We prove that when /3 < /3s(?) the mixing time is asymptotically C{l3,q)n\ogn and 
the dynamics exhibits the cutoff phenomena, a sharp transition in mixing, with a window 
of order n. At /3 = /3s{q) the dynamics no longer exhibits cutoff and its mixing obeys 
a power-law of order n*^^. For /5 > Ps{q) the mixing time is exponentially large in n. 
Furthermore, as /3 t /^s with n, the mixing time interpolates smoothly from subcritical 
to critical behavior, with the latter reached at a scaling window of 0{n~^^^) around 13s. 
These results form the first complete analysis of mixing around the critical dynamical 
temperature — including the critical power law — for a model with a first order phase 
transition. 

1. Introduction and Results 

We study the dynamics of the Potts model on the complete graph (mean-field) known 
as the Curie-Weiss Potts model. For n ^ 1, /3 ^ 0, the Curie- Weiss Potts distribution is 
a probability measure on S„ = where Q = {1, . . . ,q} and V = {I, . . . ,n}, defined by 

^n(o-) = Z^_^exp|(/3/n) ^ la{u)=aiv)], 

where a G S„ and is the normalizing constant. When q = 2 this is the classic Ising 
model while in this paper we will focus on the case q ^ 3 for an integer q (for an extension 
to non-integer q via the random cluster model, see e.g. [23]). We use the standard notation 
Pc{q) for the (explicitly known) threshold value between the ordered and the disordered 
phases (see [10]). 

Throughout the paper {crt)t ^ o will denote the discrete time Glauber dynamics for this 
model, namely, starting from (Tq, at each step we choose a vertex u G V uniformly and set 

a (v) = i. ^^^^■^ ^ 7^ " 

\ k with probability {(^(u) = k | a{w) = at{'w) \/w ^ u) v = u. 

We denote by Pn the transition kernel for this Markov process and by Po-g the underlying 
probability measure. We will measure the distance between the distribution of the chain 
at time t and its stationary distribution via the total-variation norm. Accordingly, 

d^°{n) = ||Po-o i^^t G •) — /^n 1 1 TV and dt{n) = max d1'^{n). 

For e G (0, 1), the e-mixing time is the number of steps until the total-variation distance 
to stationarity is at most e in the worst case, i.e.: 

iMix(e)(?^) = iiif{t : dt{n) ^ e} 

and by convention we set tMix(ra) := iMix{i/4)(^)- If foi" any fixed e G (0, 1) 

We{n) = W(e)("-) - Vx{l-e)(?^) = o(tMIx(l/4) ('^)) aS U ^ OO, 
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Figure 1. Rapid mixing with cutoff in the subcritical regime of /3 < /3s (g) for q — 3. 
Blue curve marks the magnetization vector of the Glauber dynamics along time. 

we say that the family of Markov chains exhibits the cutoff phenomenon, which describes 
a sharp drop in the total variation distance from close to 1 to close (in an interval of time 
of smaller order than tMix('^) denoted as the cutoff window) . Observe that cutoff occurs if 
and only if tMix((5)(")/*Mix{e)("-) — 1 as n — )• oo for any fixed 6,e £ (0, 1). 

1.1. Results. We show that the dynamics for the Curie- Weiss model undergoes a critical 
slowdown at an inverse-temperature f3s{q) > 0. This dynamical threshold is given by 

f3s{q) = sup |/3 ^ : (l + {q- l)e^^^) ^ - x / for ah xe {l/q, 1)| . (1.1) 

Unlike mean-field Ising, for which /3s(2) = /3c(2) = 1, the dynamical transition for q ^ 3 
occurs at a strictly higher temperature than the static phase transition, i.e., I3s{q) < (3c{q)- 
Our first result addresses the regime /? < Psiq), where rapid mixing occurs within 
0(n log n) steps and the dynamics exhibits cutoff with a window of size 0(n) (see Fig. i ). 

Theorem 1. Let q ^ 3 be an integer. If 13 < f3s{q) then the Glauber dynamics for the 
q-state Curie-Weiss Potts model exhibits cutoff at mixing time 

tuixin) = ai{/3,q)nlogn (1.2) 

with cutoff window w^{n) = Oe{n) where ai{P,q) = [2 (1 — 2/3/g)]^^. 

We proceed to analyze the order of the mixing time as (3{n) — )• (3s{q) as n — )• oo, 

/3(n) = /3,(g)-e(n) (1.3) 

where S,{n) — )• as n — )• oo. The asymptotics of the mixing time will, of course, depend 
on how fast ^ decays, but it turns out that cutoff is observed only iff the decay is slow 
enough. This is captured in the following theorem. 

Theorem 2. Let q ^ 3 be an integer. With (3{n) given as in equation (' ''') we have: 

(1) If limn^oon'^^^Cin) = oo then the Glauber dynamics has cutoff with mixing time 
and cutoff window given by 

tuixin) = ai(/3(n),g)nlogn-ha2(g')"'/\/^t^, 

w,in) = 0,{n+^f^/C{n)^), (1.4) 

where 02(5) is a positive constant and ai is the constant defined in Theorem . 

(2) If ^ liminf„_5.oo "'^''^^('T') ^ 1™ sup„_^o^ n^/^^(n) < 00 then the dynamics does 
not exhibit cutoff and has mixing time 

tMM.){n) = e,{n^/^) . (1.5) 
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Figure 2. Slow mixing without cutoff in the supercritical regime P > j3s for q = On 
left Pa < P < Pc and on right P > Pc.- Curve color marks time from blue to red. 

Part ( ) of Theorem 2 in particular applies at criticality /? = j3s{q) where the mixing time 
is of order n^/^ with a scaling window of order n"^/^ (in contrast, the mixing time for the 
critical Ising model is of order rt'/'^ with a window of y/n). 

Finally, above /3s((?) the mixing time is exponentially large in n, as depicted in Fig. 2. 

Theorem 3. Let q ^ 3 be an integer, and fix (3 > (3s{q)- For every < e < 1 there exist 
Ci, C2 > such that for all n, 

^ Ciexp(C2n) . 

Combined these results give a complete analysis of the mixing time of Glauber dynamics 
for the Curie- Weiss Potts model. 

The slowdown in the mixing of the dynamics occurring as soon as /3 ^ t^sio) (be 
it power-law at I3s{q) or exponential mixing above this point) is due to the existence 
of states from which the Markov chain takes a long time to escape. However, in the 
range (5 G [l^sio)-, Pc{q)) the subset of initial configurations from which mixing is slow is 
exponentially small in probability. One can then ask instead about the mixing time from 
typical starting locations, known as essential mixing. Define the mixing time started from 
a subset of initial configurations S„, C S„ via " (n) = max y d1{n) as well as 

t^-^(,){n) = \ni{t: df-{n)^e} and "(n) = " ^Mixd-e) 

With these definitions we have the following result, showing that the subcritical mixing 
time behavior from Theorem 1 extends all the way to /3 < /3s (g) once one excludes a subset 
of initial configurations with a total mass that is exponentially small in n. 

Theorem 4. Let q'^ 3 he an integer and let (5 < Pc{q)- There exist constants Ci, C2 > 
and subsets $]„ C such that the Glauber dynamics has cutoff with mixing time and 
cutoff window given by 

tmxin) = ai(/3,g)nlogn ; wf"{n) = 0,{n) , 

where fini^n \ ^n) ^ Cie~^^"' and ai is the constant in Theorem 1. 

1.2. Related work. Through several decades of work by mathematicians, physicists and 
computer scientists a general picture of how the mixing time varies with the temperature 
has been developed. It is believed that in a wide class of models and geometries the mixing 
time undergoes the following "critical slowdown". For some critical inverse-temperature 
Po and a geometric parameter L{n), where n is the size of the system, we should have: 
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Figure 3. Minimal drift towards 1/g of a single coordinate of St as a function of its 
value for the Curie- Weiss Potts model with g = 3 and different values of /3. Two lowest 
curves correspond to P < Pa, green middle curve has P = Ps (with points marking its 
two local extrema), second from top curve has Pa < P < Pc and top curve has P = Pe- 
rn High temperature (0 ^ /3 < P^i): mixing time of order nlogn with cutoff. 

• Critical temperature (/3 = /J^): mixing time of order nL[nY for some fixed z > 0. 

• Low temperature {j3 > Pd)- mixing time of order exp(r/3L(n)) for some fixed > 0. 

For a more comprehensive description of critical slowdown see [19,30,33]. It should be 
noted that to demonstrate the above phenomenon in full, one needs to derive precise esti- 
mates on the mixing time up to the critical temperature, which can be quite challenging. 

Perhaps the most studied model in this context is Ising. For the complete graph, a 
comprehensive treatment is given in [17,18,28], where critical slowdown (as described 
above) around the uniqueness threshold f3c is established in full. In this setting, finer 
statements about the asymptotics of the mixing time can be made. For instance, in [17] 
the case where f3 approaches /3c with the size of the system is analyzed (in Theorem 2 here 
we consider this case as well). The same picture, yet with the notable exclusion of a cutoff 
proof at high temperatures, is also known on the d-regular tree where [•>] established the 
high and low temperature regimes and recently [I'l] proved polynomial mixing at criticality. 

From a mathematical physics point of view, the most interesting underlying graph to 
consider is the lattice Z"'. For d =2 the full critical slowdown is now known: for a box with 
n vertices the mixing time is O(nlogn) throughout the high temperature regime [34,35] 
whereas it is exp((r^ + o{\))n) throughout the low temperature regime [13, 14,39] with rg 
being the surface tension. The d = 2 picture was very recently completed by two of the 
authors establishing cutoff in the high temperature regime [32] and polynomial mixing at 
the critical temperature [30]. For a more comprehensive survey of recent literature for 
Ising on the lattice see [30]. 

Turning back to the Potts model, understanding the kinetic picture here is of interest, 
not just as an extension of the results for Ising, but as an example of a model with a 
first order phase transition. Unlike in Ising, the free energy in the Potts model on various 
graphs and values of q undergoes a first order phase transition as the temperature is varied. 
This is certainly true for all ^ 3 in the mean-field approximation, i.e. on the complete 
graph as treated here, but also known to be the case on Z*^ for d ^ 2 and q > Q{d) for 
some Q{d) < oo [23] (although most values of Q{d) are not known rigorously, it was shown 
that Q(2) = 4 [ ] and Q{d) < 3 for all d large enough [G]). 

A first order phase transition has direct implications on the dynamics of the system. 
For one, the coexistence of phases at criticality, implies slow mixing. This is because 
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getting from one phase to another requires passing through a large free energy barrier, 
i.e. states which are exponentiaUy unhkely. Indeed, in [10, 11] the mixing time for Potts 
on a box with n vertices in Z*^ for any fixed d ^ 2 and sufficiently large q is shown 
to be exponential in the surface area of the box for any (3 larger or (notably) equal to 
the uniqueness threshold f3c{d,q). This should be compared with the aforementioned 
polynomial mixing of Glauber dynamics for Ising at criticality. In fact, coexistence of the 
ordered and disordered phases also accounts for the slow mixing of the Swendsen-Wang 
dynamics at the critical temperature. This is shown in [1(J, J i ] for Z*^ under a similar range 
of d and q and in [ ■ -] for the complete graph. Other dynamics also exhibit slow mixing 
at criticality [I]. 

First order phase transitions are expected to lead to metastability type phenomena on 
the lattice in some instances. There has been extensive work on this topic (see [5, 12] 
and the references therein) yet the picture remains incomplete. It is expected that the 
transition to equilibrium will be carried through a nucleation process, which has an 0(1) 
lifetime and therefore does not affect the order of the mixing time in contrast to the mean- 
field case. This is affirmed, for instance, in Ising where 0(n log n) mixing time is known for 
low enough temperatures under an (arbitrarily small) non-zero external field, despite the 
first order phase transition (in the field) around 0. For related works see e.g. [7,8, 15,36,37] 
as well as [■/>] and the references there. 

Similarly, the Potts model on the lattice should feature rapid mixing of 0(n log n) 
throughout the sub-critical regime /? < Pc{d, q) due to the vanishing surface-area-to- volume 
ratio. Thus, contrary to the critical slowdown picture predicted for Ising, whenever there 
is a ffist order phase transition it should be accompanied by a sharp transition from fast 
mixing at /? < /3c to an exponentially slow mixing at f3c in lieu of a critical power law. 
However, fast mixing of the Potts model on Z"^ for /? < /3c is not rigorously known except 
at very high temperatures (where it follows from standard arguments [•)■>]). For sufficiently 
high temperatures, cutoff was very recently shown in [ ]. 

On the complete graph however, metastability is apparent. In the absence of geometry, 
the order parameter sufficiently characterizes the state of the system and thus the dynamics 
and its stationary distribution are described effectively by the free energy of the system as 
a function of the order parameter. While coexistence of phases implies that at criticality 
the free energy is minimized at more than one value of the order parameter (corresponding 
to each phase), continuity entails that some of these global minima will turn into local 
minima just below or above criticality. These local minimizers correspond exactly to 
the metastable states and the value (or curve) of the thermodynamic parameter (e.g. 
temperature) beyond which these local minima cease to appear is called spinodal. 

Consider the model at hand, namely Potts on the complete graph (for general back- 
ground on the Potts model, see e.g. [9,21,23]). The order parameter here, analogous 
to the magnetization in the Ising model, is the vector of proportions of each color s £ 
S = {x G : = 1}. It is well known that there exists Pdq), below which the 

Potts distribution fin is supported almost entirely on configurations with roughly equal 
(about 1/q) proportions of each color and above which /i„ is supported almost entirely on 
configurations where one of the q colors is dominant. In the former case we say that in 
equilibrium the system is in the disordered phase, while in the latter case we say that in 
equilibrium the system is in one of the q ordered phases, corresponding to the g-colors. 

Up to relabeling of the vertices configurations are essentially described by the propor- 
tions vector s and as such, on a logarithmic scale, the Potts distribution can be read from 
the graph of the free energy as a function of s. As depicted in Figure i (showing q = 3, the 
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Figure 4. The free energy as a function of the proportions vector s for the Potts model 
on the complete graph with q = 3 and large n. The simplex 5 = {s € K+ : ||s||i = 1} is 
mapped into the XY plane via (s^, s^, s'^) h> (s^, s^, 1 — — s^). 

situation is qualitatively the same for all q> 2), when (3 < (3c the free energy has a single 
global minimum at the center, corresponding to the disordered phase, while for /3 > /5c 
there are q "on-axis" global minima, corresponding to the q ordered phases obtainable 
from one another through a permutation of the coordinates. At /3c, coexistence of the 
ordered and disordered phases is evident in the presence of g + 1 global minima of the free 
energy. For more details see, e.g., [1(5]. 

Below /3c but sufficiently close to it, the free energy, globally minimized only at the cen- 
ter, has q local minima in place of the q global minima which corresponded to the ordered 
phases at criticality. Once /3 is too small, these local minima disappear. The threshold 
value for the appearance of these local minima is the spinodal inverse temperature /3s 
(there is a similar behavior above (3c marked by a second spinodal temperature (3s, as 
illustrated by Figure i, but we do not address this regime in the paper). 



0.2 0.4 0.6 0.8 1.0 0.48 0.49 0.50 0.51 0.52 0.53 

/3 < /3,(2), /? = /3,(2), /3 > /3,(2) /3 < /3,(2), /? = /3e(2), /? > /3e(2) 

Figure 5. The free energy as a function of the magnetization m for the Ising model on 
the complete graph for large n. No phase coexistence at /3c and the global maximizers 
for P > Pc are seen to emerge continuously from m — 0.5. 

Once the system starts from an initial configuration whose proportions vector is close 
to a local minimizer, the system will spend a time which is exponential in n near this 
minimizer before escaping to the global minimizer and reaching equilibrium. This is 
because away from a local minimizer, energy increases locally exponentially (in n), i.e. 
there is an energy barrier of an exponential order to cross. Thus, as n — )• cxd the system will 
spend an unbounded amount of time at a non-equilibrium state, which will be seemingly 
stable. In terms of the mixing time of the dynamics, as the definition involves the worst 
case initial configuration, metastable states will result in exponentially slow mixing. 

Our result is a rigorous affirmation of this picture. Although the definition of /3s in (f .1) 
seems different than the one given above for the spinodal inverse temperature, it can be 
shown, in fact, that this is indeed the threshold value of /3 for the emergence of local 
minima below /3c- Theorem then asserts that above f3s mixing is exponentially slow 
while Theorem 1 shows that below /3s mixing is still fast. The set of configurations whose 
exclusion in Theorem 4 leads to fast mixing all the way up to (but below) /3c are precisely 
the ones from which the process will get stuck in a metastable state. Indeed as the free 
energy of such initial configurations is higher than that of configurations near the globally 
minimizing stable state, such configurations will have a probability which is exponentially 
small in the size of the system. 

Furthermore, the transition from fast to slow mixing passes through polynomial mixing 
which occurs at (3s and in its vicinity (Theorem ). This in fact establishes that the 
aforementioned critical slowdown phenomenon occurs here as well, albeit at the spinodal 
rather than the uniqueness threshold. We predict that this should be the case for the 
dynamical behavior on other mean-field geometries such as an Erdos-Renyi random graph 
or a random regular graph. 

For a (non-rigorous) treatment of metastability and its effect on the rate of convergence 
to equilibrium in other mean-field models with a first order phase transition, see for ex- 
ample [24,25]. A rigorous analysis of such a system (the Blume-Capel model), below and 
above criticality, was recently carried out in [26]. It is illuminating to contrast the graph 
of the free energy as a function of the proportions vector in the Potts model to that of the 
free energy as a function of the magnetization in Curie- Weiss Ising, given in Figure 5. The 
second order phase transition and lack of phase coexistence at the critical temperature, 
implies the absence of a local minima at any value below or above /3c- As a consequence 
there is no spinodal temperature and mixing is fast throughout the whole range /? < /3c. 
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1.3. Proof Ideas. As discussed before, up to a permutation of the vertices, configurations 
can be described by their proportions vector. Formahy, for a configuration a E S„, we 
denote by S{a) the g-dimensional vector (5^((t), . . . 5'^(cr)), where 

Note that S{a) E S where 5 = {x E : = 1}. Now it is not difficult to see that 

St = S{at) is itself a Markov process with state space S and stationary distribution 7r„ = 
fin°S~^, the distribution of S{a) under We shall refer to this process as the proportions 
chain. Figures 1 and 2 show a realization of the proportion chains superimposed on 
the free energy graph plotted upside down for better visibility. 3 different values of /3, 
corresponding to 3 different regimes are exhibited. The color of the curve, representing 
time, shows the temporal evolution of the proportion chain. Notice how local minima 
(shown as local maxima) "trap" the chain for a long time. 

As a projection of the chain, St mixes at least as fast as at- Moreover, when starting 
in one of the q configurations where all sites have the same color, a symmetry argument 
reveals that the mixing time of St is equal to that of at- One therefore must control the 
effect of starting from a initial state which is not monochrome. Using a coupling argument 
we show that the difference in the mixing times is of the same order as the cutoff window 
for St and can thus be absorbed into our error estimates. It will then suffice to analyze the 
proportions chain, which is of a lesser complexity than the original process. In particular 
the state space of St has a fixed q — 1 dimension, independently of n, and its transition 
probabilities can be easily calculated. 

Next, we show that when /3 < f3c{q) most of the mass of the stationary distribution 7r„ 
is concentrated on balanced states whose distance from the "equi-proportionality" vector 
. . . , 1/q) is 0{l/y/n). A simple coupling argument then shows that the Markov chain 
is mixed soon after arriving at such a state. Thus the main effort in the proof becomes 
finding sharp estimates on the time is takes for St to reach a balanced state from a worst- 
case initial configuration, in different regimes of /3. 

It turns out that these hitting times are determined by the function L'^(x), which is 
defined as (up to a multiplication by 1/n) the drift of one coordinate of St when that 
coordinate has value x in the worst case, i.e. the minimum drift towards 1/q, where the 
minimum is taken over all possible values for the remaining coordinates. An explicit 
formula for Dp{x) can be obtained (•^>.2). Its graph is plotted in Figure for x E 1] 
and different values of /3. For (3 <^ f3c, this drift is strictly negative in {1/q, 1] and thus 
each coordinate quickly (in 0(n log n) time) gets to within 1/y/n of 1/q. On the other 
hand, the function Dp{x) is monotone increasing in /3 and therefore for sufficiently large 
j3, it will no longer be negative throughout {1/q, 1] - there will be an interval in {1/q, 1] 
where it is positive. Such an interval will take an exponential amount of time to traverse 
and this will lead to an exponential mixing time. The smallest (5 for which this happens 
is, by definition, (3s. This j3s in turn coincides with the inverse temperature at which local 
minima begin to appear in the free energy as a function of s. In fact, to show exponentially 
slow mixing, we use standard conductance arguments, which in face of local minima in 
the free energy give exponential mixing quite automatically. 

The most delicate analysis is in the critical regime where /3 is near or equal to /3<j. In 
this case the x-axis is tangential to the graph of Dp{x) at its peak (the green curve in 
Figure ■) and the challenge is in finding the asymptotics of the passage time through the 
tangential point on the x-axis (left green dot in the figure). As the drift there is 0, locally 
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around this point, a coordinate of St behaves as a random walk and Doob's decomposition 
of a suitably chosen function of the coordinate yields the right passage time estimates. 

1.4. Organization. Section 2 sets notation and contains some useful facts on the Curie- 
Weiss Potts model, as well as tools (and a few non-standard variations on them) needed 
in the analysis of mixing time. In Section we derive basic properties of the proportions 
chain that will be repeatedly used in the remainder of the proof. In Section we analyze 
the case /3 < /3s{q) and prove Theorem I while Section o treats the case f3 > Psil) and 
establishes Theorem 3. The near critical regime is analyzed in Section (i, which includes 
the proof of Theorem 2. The final section, Section , gives the proof of Theorem I. 

2. Preliminaries 

2.1. Notation. We let [a, 5] denote the set {a, ...,6} for a, 6 G Z. We use the same 
notation for vector and scalar valued variables. For an m-dimensional vector s, we denote 
by its fc-th component and for / = C [l,m], = (s*^ , . . . , Matrix- 
valued variables will appear in bold. We let W™''^ denote the (m, k) element of W and 
let denote is its m-th row. 

We write for the unit vector in the z-th direction and e = (1/^, 1/g, . . . , 1/g) E M'^ for 
the equiproportionality vector. For s G M'^, we denote 's = s — e. 

Most of our vectors will live on the simplex S = {x £ : = 1} or even Sn = 

S n ^Z'^. Occasionally we would like to further limit this set and for p > we define 

Sp = {seS: ||?|U < p}, sp = Sfn T.pn = s-\s?,) 

SP+ = {seS: < l/q + p \/ke [1, q]}, = SP+ D ^Z" , = 3-^8^+). 

Note that SP~^ C S^'^~^^p and similar relations hold for 5„ and S„. 

Vectors in S will often be viewed also as distributions on [l,q]. A coupling oi u, D G S 
is the joint distribution of two random variables X, X, defined on the same probability 
space and marginally distributed according to z^, V. If P* is the underlying probability 
measure then we always have {{v — t'llxv ^ ^ X). We shall call this coupling a best 

coupling if it satisfies — ^Htv = IP*(^ 7^ X). Such a coupling always exists. 

In the course of the proofs, we introduce various couplings of two copies of the Glauber 
dynamics {at)t- For the second copy we shall use the notation at and St = S(at). Couplings 
will be identified by their underlying probability measure, for which we will use the symbol 
P with a superscript that changes from coupling to coupling, e.g. P^*-^. A subscript will 
indicate initial state or states, e.g. F^^~^. The expectation and variance, E[-] and Var(-) 
resp., will be decorated in the same way as the underlying measure with respect to which 
they are defined. The cr-algebra J-t will always include all the randomness up-to time t. 
For example, with a single chain {(Jt)t this is the cr-algebra generated by {0"^ : s ^ t}, for 
a coupling of two chains {(Tt)t, (5t)t, it is the one generated by {cTs,?^ : s ^ t}, etc. 

2.2. Large Deviations Results for the Curie- Weiss Potts Distribution. In this 
subsection we recall several results concerning the concentration of the proportions vector 
measures 7r„. See, e.g., [10,20] for proofs of these results. 

It is a consequence of Sanov's Theorem together with an application of Varadhan's 
Lemma that the sequence {iTn)n ^ 1 satisfies a large deviation principle (LDP) on S with 
rate function 

k=l 
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where C is chosen so that miiisg^ Ii3,q{s) = 0. The minimizing set ep^q = {s : Ii3,q{s) = 0} 
which is the support of the weak hmit pp^q of (7r„)„ ^ i is then 

{e} if/3</3c(g) 
ep,q={ {e,T'sp^,q,THp^,q,...,Tisp^,q} iff^ = Pc{q) , (2.2) 

{T^Sp,q,THf,,q,...,T1sp^q} H ^ > Pc{q) 



where 



and 




4,.'^r^'---'^r^ • (2-4) 



The function /3 i— )• ^ is continuous and increasing on [/3c(g),oo) and T*^ : 5 — )• 5 
interchanges the 1-st and k-ih coordinates. Furthermore, the value of sp^q for all /3, q is 
known in implicit form and in particular for /3 = f3c{q), we have 



This is true for aU q ^ 3. For g = 2, (2.1), (2. 2), (2.4), (2.5) stih hold, but the critical 
inverse-temperature is now 

/3c(2) = 1. (2.6) 
It is here that a fundamental difference between q = 2 and q > 2 can be observed. If 
q = 2 then s^^(2),2 = in which case /3 i— )• p^^2 is continuous for all /3 ^ 0. On the other 
hand, if g ^ 3 we have sp^i^q-^^q / e and j3 p^a^g is discontinuous at Pdq)- Thus, as it 
is recorded in the Physics literature, the system exhibits a first order phase transition if 
g ^ 3, but only a second order phase transition if g = 2. 

2.3. Hitting Time Estimates for General Supermartingales. We will require some 
standard hitting time estimates for supermartingales and related processes. 

Lemma 2.1. For xq G M, let {Xt)t be a discrete time process, adapted to {Ft)t ^ o 
which satisfies 

(1) 36^0 : E^.o [Xt+i - Xt\ Tt] ^ - 6 on {Xt ^ 0} for all t ^ 0. 

(2) 3R>0 : \Xt+i - Xt\ ^ i?, Vt ^ 0. 

(3) Xo = xo. 

where P^^o ^-^ the underlying probability measure. Let t~ = inf{t : ^ x} and = 
inf{t : Xt ^ x}. The following holds. 

(1) If 6 > then for any ti ^ 0; 

P.o(ro->ti) ^exp|-^^^|^|. (2.7) 

(2) If Xq ^ 0, then for any xi > and t2 ^ 0, 

« ^ h) ^ 2exp|-i^^} . (2.8) 

(3) If Xq ^ 0, 5 > 0, then for any xi > and t^ ^ 0; 

(4^t3) ^^iexp{-^^^^i^}. (2.9) 



■ 2^0 
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Proof. Starting with (i), if xq < 0, there is nothing to prove. Otherwise, let Zt be a 
supermartingale independent of Xt, which starts from 0, has drift —6 and steps which 
are bounded by R. Set Yt = X,. ~ + and write Yj = Mt + At as its Doob 

Decomposition, with Mt a martingale. At a predictable processes and = x<d- Clearly 
At^XQ — 6t and |-Mt_|_i — Mt\ ^ 2R, F^g-as. The Hoeffding-Azuma inequality implies: 

Pxo {To >tl) ^ P:.o (^tl > 0) 

^ P^o {Mt, > 6ti - xo) 

as desired. 

Now set Yt = X^.+ and observe that {Yt)^ > q satisfies conditions l- -, and consequently 
it is enough to prove (2.8), with P^^, (y^^ > xi) as the LHS. Therefore, for all t, let Wt+i = 
^xo [Yt+i - Yt\ Tt] and Zt+i = Yt+i-Yt-Wt+i- Then clearly, Wt+i, Zt are J"t-measurable, 
Wt+i ^ on {Yt ^ 0} and [Zt+i\J^t] = 0. This is a Doob-type decomposition. Now, 
define Mt for all t inductively as follows. 

Mo = ; Mt+i = Mt + sign(Mt)Zt+i 

and set Nt = \Mt\. We claim the following: 

(1) (M()^gp^ is an (J^t)j > g-adapted martingale. 

(2) iVo = 0; Nt+i = \Nt + Zt+i\,yt. 

(3) ^ iVt + i?. 

The first two assertions follow straightforwardly from the construction. The third one, 
can be proven by induction, since it clearly holds for t = and assuming Yt-i ^ Nt^i +R, 
if Ft-i ^ 0, then 

Yt = Yt-i + {Yt-Yt-i) 

^ Nt-i+R + Zti^Nt + R, 



and if Yt-i < 0, then 

Yti^Ri^Nt + R. 
Finally, by the Hoeffding-Azuma inequality applied to Mt we have 



\,iYt,>xi) ^ F^^ i\Mt,\ > xi - R) 

{xi-R? 



^ 2 exp 



St2R^ 



This shows (2.8). 
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For part (•)), let Tq (s) = inf{t s : Xt ^ 0} and Yt{s), Mt{s) and At{s) defined as in 
the proof of part ( I ) only with Tq~(s) replacing Tq . Then, 

IPxo (t+ ^ ta) ^ Yl ^=^0 (^^1 < ^ XI, > Vsi ^ t ^ S2) 

s; si<s2 ^ 

s£ si<S2 ^ t-j, 

^ Y Pxo(M,,(si)-M,,(si) ^5(s2-si) + (xi-fl)+) 

s; si<s2 ^ t'i 



where the last inequality follows from Hoeffding-Azuma inequality and since the summands 
are non zero only when ^ si < S2 ^ ^3 and (s2 — si)R ^ xi — R. □ 

Lemma 2.2. Let {Xt)teN be a process adapted to (J^t)t£f>i oLnd satisfying the following 
conditions for some a > 25 ^ 0; 

(1) Xt+i-Xt G {-1,0,1}. 

(2) nXt+i-Xt\Tt] -b. 

(3) YariXt+i\Tt) ^ a 

(4) Xo ^ 

Let T+ = inf{t : Xt^r}. Then 

P(r+ ^ t) ^ Ci exp{-C2{r/Vi + dVif} + 0{t'^/^) (2.10) 

where Ci,C2 are positive constants which depends only on a. 

Proof. It is easy to verify that conditions i-'-i imply 

F{Xt+i ^ Xt\Tt) ^ a ; F{Xt+i - Xt = -l\Tt, Xt+i ^ Xt) ^ + ^ . (2.11) 

Now, let To = and = inf{t > Tk_i : Xt / Xt^^_J, Yfc = Xt^ for k ^ I. Also, let 
= Tk — Tk-i and = Yk — Y^-i- From (-.i-l), it is not hard to see that we can couple 
i^k)k, {Zkjk with two i.i.d sequences {Nk)k,{Zk)k such that A''^ ^ N^, Zk ^ Zk a.s. and 

M n / \ V / +1 w.p 1/2 - bl2a 
Ar,~Geom(a) ; ^fc = | w.p 1/2 + ' 

Consequently, with = Zlm ^k'^rn and = ^k^rn^e have: 
P(t+ < t) ^ P(yt,/2 ^ r, Ti,/2 ^ t) 

^ p(yi,/2 ^ - IP(^W2 > 

^ Ci exp |-C2^^^^^| + 0(t-V2) + exp{-C3t} , 

where the last inequality follows by the local CLT for Yj,^ which is a nearest neighbor 
random walk whose steps have mean —5 /a and variance 1 — (S/a)'^ and also by Cramer's 
theroem for T^. All constants depend only on a. □ 

Finally, we will also make use of the following result from [29]: 

Lemma 2.3 ([ , Proposition 17.20]). Let (Z()j>q be a non-negative supermartingale 
adapted to {Gt)t > o ^^'^ ^ a stopping time. Suppose that: 
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(1) Zo = Zq 

(2) \Zt+i-Zt\ ^ B 

(3) 3cr > such that Yai{Zt+i\Qt) > cr'^ on the event {N > t}. 
Ifu> AB^/{3a^), then: 

P.0 {N>u) ^ 



0^^/U 

2.4. Variance Lemma. The following is a straight-forward extension of [ , Lemma 2.6] 
to vector valued Markov processes. We include a proof here for completeness. 

Lemma 2.4. Let (Zt) be a Markov chain taking values in and with transition matrix 
P. Write P^g, for its probability measure and expectation respectively, when Zq = zq. 
Suppose that there is some < r] < 1 such that for all pairs of starting states {zo,zo), 

\\E,,Zt - E^^^ZtlU ^ v'Wzo - zoh (2.12) 

Then vt = sup^^ Var^p (Zt) = sup^^ E^oll^t - ^zoZtWi satisfies: 

^ uiminjt, (1 -r/2)"^| . (2.13) 

Proof. Let Zt and Z^ be independent copies of the chain with the same starting state zq. 
By the assumption (2.12), we obtain that 

\\E,,[Zt I Zi = zi]-E,,[Z: I Zl = zl]\U = m,,[Zt-i] - E,,JZI,]\\2 ^ v'^'W^i " 4^ ■ 
Hence, we see that 

Var,„(E,jZi | Zi]) = h,,\\{EzAZt-i]-Ez;[Z:_,])f. ^ ^^E,J|Zi-Zi*||2 ^ ^^^'-^^vi. 
Combined with the total variance formula, it follows that 

vt ^ sup{E,o[Var,„(Zt | Zi)] + Var,„(E,jZt | Zi])} ^ vt-i + r]^^'-'^vi , 

which then gives that vt = Yll=ii''^i ~ ''^i-i) ^ Yll=i ^'^^^^^■'vi, implying the desired upper 
bound immediately. □ 

2.5. Bottleneck Ratio. Let P be an irreducible, aperiodic transition kernel for a Markov 
chain on S with stationary measure vr. The bottleneck ratio of a set A C 5 is: 

^f.. ^ 'Zx&A,yiA<x)P{x,y) ^{dpA) 

where dpA = {x G A : P{x,y) > for some y ^ A}. The bottleneck ratio of the chain is 

= min ^{A) . (2.14) 

A:tt{A) ^ \ 

The following result, due to [i ,27,38] in several similar forms (see, e.g., [29, Theorem 7.3]) 
relates the bottleneck ratio with the mixing time of the chain. 

Theorem 2.5. is the bottleneck ratio defined in (2.14) then tMix(i/4) ^ 45"- 

3. Drift Analysis for the Proportions Chain 



In this section we prove various results concerning the drift of the process St ■ We analyze 
both the one coordinate process S\ and the distance-to-equiproportionality — e||2. In 
the course of this analysis, we also define two couplings which will be of independent use 
later on and prove a uniform bound on the variance of St. 
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3.1. The Drift of One Proportion Coordinate. From symmetry, it is enough to 
analyze the drift of S^. For /3 ^ 0, define : 5 — )• 5 as 



We can express the drift of 5/ as follows: 



i=i 



¥.[Sl_,,-Sl\Ft_ 



-S] + Y,gl{St-le,)S, 

k=l 

J [-51 + gl (St)] + O 



dfsiSt) + O {n~^) 



with 



df 



(3.1) 
(3.2) 



= -s'+g}{s). 

The function : 5 — )• M thus describes (up to a constant factor of and an error 
term) the drift of the first coordinate given the current proportions vector. It turns out 
the rapid mixing hinges on whether d/3(s) is strictly negative whenever > 1/q (and for 
any values for the remaining coordinates of s). Accordingly we define Dj^ : [0, 1] — )• R as 



Df}{x) = maxd^(s) = d/3(2;, J^f , . . . , ^) 



-x + 



exp{2/3x) 



exp(2/3x) + (g-l)exp(2/3i5f) 



(3.3) 



(3.4) 



and check when Dp{x) is strictly negative for all x G (I/9, 1]- We will see in Proposition 3.1 
below that this happens if and only if /3 < Psil), where (3s{q) > is defined in (i.i). 
For /3 ^ 0, define s*{(3) and sf(/3) as: 

s*(/3)^sup{se[l/g,l): £Dp{s) = 0} 

s«(/3)^inf{sG(l/g,l): D^{s) ^ 0} , 

with the inf or sup being 1, if the respective sets are empty. We may now state: 

Proposition 3.1. For all q ^ 3 the following holds: 

(1) We have that Dj3{s) is increasing in P if s G [l/q,l] and for all /3 ^ 0, that 
Dp{l/q) = and that Dp{l) < 0. 

(2) ThatsKP) > \ iff3<q/2. 

(3) The following statements are equivalent if f3 < q/2: 

(a) s«(/3) = 1 

(b) Dp{s) has no roots in (1/g, 1]. 

(c) Dp{s*m<f). 

(4) All the statements in part (3) hold if and only if (3 < f3s{q). 



Proof. We start by proving part 



It is clear that 
1 



-s + 



l + (g-l)exp(2/3i^; 
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and hence is strictly increasing in/3ifl/q<s^l. Furthermore, we have that 

1 e2/^/« 1 1 

^^^^/^^ = - g + eW. + (, - l)eW. = - ^ + ^ = ° ' 



as weU as 



For part ( ), taking the derivative of Dp{s) with respect to s evaluated at one 
obtains ^^/^(■s)!^^^/^ = -1 + ^ which is negative if /3 < q/2. Together with D/s^l/q) = 0, 
this completes the proof. 

For part 3, we first show that there exists at most two points in 1] such that 
^Dp{s) = 0. To see this, we compute the first derivative and obtain that 

d^,, 2g/3exp(2/3^) T^2g/3,., ^.^.l^. 

---Dr[s) = — 1 H — = — 1 H hiiq — lie "J-i , 

ds ' [i + (<^_i)exp(2/3^)P q-1 ' ^' 

where h{x) = j^+xy^- Obviously, there are at most two zeros for —1 + and since 

{q — 1) exp(2/?^^) is a strictly monotone in s, we conclude that there are at most two 
points such that ^Z)^(s) vanishes. 

Notice also that ^Dp{l/q) < provided that fi < q/2 and hence Dp{\/q + ^) < for 
all i ^ ^Q, where .^o is a sufficiently small positive number. We are now ready to derive 
the equivalence stated in the proposition. Observing that Dp{s) is a smooth function and 
Dj^{l) < 0, we deduce that (3a) =^ (3i)) ( ). It remains to prove that (3c) ^ ( " )• 
Suppose now that ( ) holds and there exists sq £ {l/q, 1] such that Dp{so) ^ 0. Recalling 
that Dp{\/q + ,^) < and Dp{l) < 0, we deduce the following: 

• If So < s*, we will then have at least two zeros in {l/q,s*) for ^Dp{s). 

• If So > s*, we will then have at least one zero in (s*, 1) for ^Dp{s). 

We see that the first case contradicts with the fact that there can be at most two zeros for 
j^Dp{s) and the second case contradicts our definition of s* . Altogether, we established 
that (3c) (3a). 

As for the last part, continuity and part (1) imply that ( ' ') is equivalent to 

Ps = sup{/3 ^ : Dp{s) < for aU s G {1/q, 1]} . 

Since Dp{s) is increasing in j3 for all s G 1], it follows that for all /3 < Ps{q), we 
indeed have -D^(s) < for all s G (1/g, 1]. On the other hand, by the continuity of the 
function Dp{s) and our definition of /3s, we know that there exists sm £ (!/<?) 1] such that 
D/3^{sm) = 0. Now, using the result of part ( ) we conclude that Dp{sM) > for any 
P > j3s, completing the proof. □ 

In the following proposition we discuss the relation between /3s (g) and Pc{(l)- 

Proposition 3.2. For q^Z we have that < /3s(g) < /3c(g) < q while /3s(2) = /3c(2) = 1. 

Proof. As recalled in the Subsection 2.2, 

/3.(2) = 1 and /?.(,) = ; O 3. 

q-2 

In the q = 2 case, it is easy to verify that ^^Dp{s) < for all s G [|, 1] if /3 < 1 and 
£D^{1) > if ^ > 1. Since in addition D,3{^) = 0, Dp{l) < 0, we obtain /3s(2) = 1. 
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For q ^ 3 we have (3c{q) < q/2 and therefore 



and 



l + {q-l) exp {2(3,^^) 



9 



l + (g-l)exp 2/3, 



_2_ 



9-1 

2(g - 1) log((7 - 1) - - 2) 



g(g-2) 

Now if (piq) = 2{q - 1) log(^ - 1) - q{q - 2) then </)(2) = (/.'(2) = and = -2 + ^ 

which is negative when q > 2. It fohows that (p{q) is negative when q > 2 and hence for 
q^3, 

d 



< 0. 

9 



So for smah enough e > and s G — e, 2-^) we have that Dj3^{s) > and hence 
supg Dp^{s) > 0. By the smoothness of Dp{s) this imphes that there exists (3 < f3c such 
that sup J, Dj3(s) > which estabhshes that /3s < (3c- □ 

We will make repeated use of the the following proposition throughout the paper. 

Proposition 3.3. 

(1) Assume (3 < q/2. For allO < pQ < p small enough, there exists 7 > and C,c> 
such that for all n with t = e'^" we have 

(30 ^ s ^ t : as^ ^ Ce"^'' (3.5) 

for all ctq G Sn° ■ 

(2) Assume (3 < q/2. For all ro > 0, 7 > there exists C,c > such that for all n 
and r > ro with t = jn, Po = ^ and p = we have 

P^o (30 ^ s ^ t : (Jsi ^ Ce-^'''' (3.6) 

for allao G 

(3) Assume (3 < f3s{q). For all p > there exists 7 > and C, c > 0, such that for all 
n, with t = 772 we have 

P., {at i ^ Ce— (3.7) 

for all (To G S„. 

Proof. Consider the process (5j — Sq • ^ ^ 0) until the first time it is to the right of l/q-\-p. 
If n is large enough and, in Case ( ), if p is small enough, Proposition 3.1 Part (1) and 
Eq. (-i.l) imply that this process satisfies the conditions of Lemma '2,1 . Parts (I) and (2) 
of the proposition then follow from Parts ( ) and ( - ) of the lemma and summing over all 
coordinates. 

As for part (3), consider this time (5^ — 1/q + p/2 : t ^ 0). Since (3 < (3s{q), it follows 
from Proposition 3.1 (Parts (1) and (3)) that this process also satisfies the conditions of 
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Lemma 2.1, with 6 > C'n^^, for some positive constant C . Now set 7 = 2/C" and apply 
part ( I ) of the Lemma to conclude that except with probability exponentially small in n, 
5*4 ^ 1/9 + p/2 for some t ^ 7n. Once this happens, by Lemma Part ( ), as in the 
proof of {■'>■■->), we have S}^^ ^ l/q + p again except with probability tending exponentially 
fast to zero with n. It remains to use union bound to complete the proof. □ 

3.2. Bounded Dynamics. The bounded dynamics is a process that evolves like at, only 
that S{at) is forced to stay close to e by rejecting transitions which violate this condition. 
Formally, fix p > and let (cr^)^ ^ q be a Markov chain on Sn"*", which evolves as follows. 
Start from some do G Sn^ and at step t + 1: 

• Draw at+i according to Pn{(7t, •), where Pn is the original transition kernel. 

• If (Tt+i € Sn"*" set at+i = a^+i and otherwise set at+i = at- 

We shall denote by P'' the underlying probability measure. 

The unbounded and p-bounded dynamics admit a natural coupling, under which the two 
processes start from the same configuration and evolve together until time r = inf{t ^ : 
St ^ 5''"''}, where St is the unbounded process. This leads to the following two immediate 
observations which will be useful later. 

(1) For any integer t and bounded function / : 1— )■ M: 

|E/(a[o,t]) - Wf{<Jy^,t])\ < 2||/|UP(r ^ t). (3.8) 

(2) In particular for any set A C (S^)*"'""'^: 

|P(cT[o,t] G A) - ^"{a^oA e ^)l ^ 2P(r ^ t) (3.9) 

3.3. Synchronized Coupling. The synchronized coupling is a (Markov) coupling of two 
p-bounded dynamics in which the two chains "synchronize" their steps as much as possible. 
Formally, define {cTt)t ^ 0, {pt)t > on the same probability space such that starting from 
CTo, ao, at time t + 1: 

(1) Choose colors It+i, h+i according to an optimal coupling of St-, St- 

(2) Choose colors Jt+i, Jt+i, according to an optimal coupling of g/s [St — n^^e/j^j), 

gi, [St - n-^ej^^^y 

(3) Change a uniformly chosen vertex of color It+i in at to have color Jt+i in at+i, 
but only if at+i € Sn^. 

(4) Change a uniformly chosen vertex of color /t+i in at to have color Jt+i in at+i, 
but only if at+i € Sn^. 

CO, (TO 

dynamics not to be bounded. 

The following shows that this coupling contracts the || • ||i distance of the proportions 
vector. 

Lemma 3.4. There exists C{f3, q) > such that for any p > 0, uniformly in aQ,aQ € S^"*" 
as n —)• 00 

Ko,i\\St - St\U ^ {p + ^) Wso-soh, (3.10) 

where 

p = p{n,(3,q) = l-^^^^- (3.11) 

n 



We shall write P^^ for the underlying measure and omit p if it is large enough for the 
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Proof. For s, s E 5„ by a Taylor expansion of g/s around s, then another expansion for 
S/gi3 around e one has: 

2/3 _ _ 
\\9l3is) - gpCPjW^ = — ||s-s||i(l + 0(||s-e||i + ||s-e||i)) , 



where we use the easily verified: 



-2p{gl) +2/35^ k = j. ^^'^^^ 



= 

Now under the bounded dynamics, It+i ^ h+i implies that Sl*^^ > 5*/*^^ and -S/*^^ > 
S/*"*"^ while Jt+i / Jj+i implies that (5*4 — n~-'^e/j_|_^) "''^^ > (^St — n^^ej^^^^ and 

(^St — n^^ej^^^^ > (5t — n^^e/j^J'^*"'"\ It follows by the definition of the coupling 
that 

\\St+i — 'S't+illi — \\St — St\\i = — 

n 

Recalling that for s G 5,^, ||s||tv = ^\\s\\i and that under the best coupling of distributions 
s, s the probability of disagreement is ||s — s||tv) we have: 

E^^f \\Si - Si\\, 

0-0,0-0 II ^ ^ll'^ 

2 

< \\so - solli + - (-^llso - solli + 5 \\ai3{so) - 5/3(so)|li + 0{n'^)) 

^ So-so lU hO{n _ 

\ n 

The result follows by iteration. □ 
3.4. Uniform Variance Bound. 

Lemma 3.5. Assume (3 < q/2. There exists po = Po{(3,q) such that if p ^ po 

Var^^ (St) = O (n-i) , (3.13) 

uniformly in ctq G and t ^ 0, and there exists 70 > such that 

Var^o {St) = O (n-i) , (3.14) 

uniformly in ctq G S^"^ and t ^ e'^o". 

Proof. Equation (3.13) will follow directly from Lemma ' applied to St under the p- 
bounded dynamics. Indeed, Lemma ). l gives a stronger version of Condition i'^ with 

rj = p + pO{n~'^). Now, if /3 < q/2 and p is small enough, we have t/ ^ 1 for large 

enough n. Then (-t.' '^') follows from (2.13) since Var^^S*! = O {jnT"^^. 

For (3.14), find p' < p and use ( ■ ,) and (■-. •) to conclude that for all G and 
t ^ e'''" for some 7 = 7(p, p'): 

Var,„ {St) = Var^^ {St) + o{n-^) = O (n-^) . □ 
Corollary 3.6. For (3 < Pc{q), we have Var^„(5) = 0(n~i). 
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Proof. Fix p < pq, where po is given in Lemma -iJ) and notice that the bounded dynamics is 
reversible with respect to the Potts measure pn restricted to . Therefore the bounded 
dynamics has /Xn(-) = Pni-\o' E Sn"*") as its stationary measure. From the large deviation 
analysis in Subsection :1:2 it is straightforward to conclude that if /3 < f^dq) 



-Cn 



for some C > and n large enough, depending on p and /3. Therefore, we have \\pr, 



PnW-T-v ^ e . Since pn, Pn live on a compact space, this gives Var^„(5) ^ Var^p(S') + 
e Since Pctq(<T4 G •) converges to pn a-s t — ^ oo for any fixed G En"*", Lemma 3.5 can 
be extended to ao chosen from pn- This completes the proof. □ 



3.5. The Drift of the Distance to EquiproportionaUty. Here we show that St has 
drift towards 0. Write St+i = St + Ct+i where we have that for j = 1, g, 



IP (6+1 = ^ (ej - e.)) = Slg^ {St - '-e,) = Sfg^^ {St) + O (n"!) . 



Then, 



E\\\st+i-m\st] 



E\\\St-e + ^t+i\\l\St] 



\St-ef,+E[Ut+i 



St]+2{E [Ct+i\St],St) 



^2 9 p'^P^l 

lsiii + ^(i-Ma)) + -E^r^,.f 

j = l L^k=l 



Si--\\St\\l + 0{n-^) 



\St\\l{l-^) + (2h{S 



n 



+ {2-2h{St))n-^ + 0{n-^) , 



where h{s) = X]fc=i fl'^(^)^'^- Notice that since g^{s) = ^^(s — e) we have that h{s) 
1/q + h{'s) and its gradient and Hessian are: 

1 



Di/i(0) = - and D2/i(0) 



-P. 



where P is a projection matrix onto (e) . Therefore, we may write 



h{s) 



+ 



2/3, 



+ 0{\\sf,) 



This gives 



E 



\s 



t+llbl-^t 



where p is defined in { 



\St"' 



2(l-2/3/g) 



n 



n 



-1/ 



0(\\St\\l)+0{n-^) 



St\\y + n-'0[\\St\\t]+0{n 



(3.15) 



3.6. Contraction for the Distance to EquiproportionaUty. Fix f3 < q/2 and p < po 

where po is given in Lemma 3.5. For what follows, assume that ctq G and t ^ e"^"" 
where 70 is also given. Then, taking expectation in equation ( • ), we get: 



E^,\\St+ih = P^E^oWStf, + (^^oWStWf) O + O (n-2) . 



(3.16) 
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Now by Taylor expansion of s i— ||s||2 around Eo-gSf in view of (3,14), 
E^JI^tll? = m^JtWo + lE.n /DiII • ||3 (E^Jt) , St - E^Jt) + O (E^JSt - E^Jt 



+ O (n-1) = (E^,\\St\\2 + O K'))'^' + O (n-i) 

N 3/2 



Then 



>. J 

St+iW'i =p2E,o||5t||2 + (E,J|^t||2)'^'o(n-i)+0(n-2) . (3.17) 



This will in turn imply: 

Proposition 3.7. Fix (3 < q/2. There exist po = Po(/3,9) > and C = C{f3,q) > such 
that if p ^ po there exists jip) > such that: 

^aoWStWl = P^WlMl + Cp^) + O (n-i) , (3.18) 

uniformly in ctq G Sn"*" and t ^ e'^^''-'", where p = p{n, /3, q) is defined in 3.11. 

Proof. Set A( = Eo-o||5f||2. It follows from ( ) and (3.5) that for any p < p < pQ, where 
po is given in Lemma ->. '>, there exists 7 = 7(p,p) > such that uniformly in ctq € Tif^ 
and t ^ e^": 

Xt+i ^ Xt + pO (n-1)) + O (n-2) = {p + pO (n^^))' + O {n~^) . 

We now use the following fact, which can be easily verified. If (Af)^> q is a sequence 
satisfying: 

Xt+i = pXt + ar* + b, 
for some p ^ r, p ^ 1, a and b, then: 

/yji^ ^ 

At = AoiJ* + a^^ + - (3-19) 

p — r 1 — p ' 

Apply this (with a = 0) and use the monotonicity in Aj of the right hand side above (at 
least if n is large enough), to conclude: 

A, ^ Ao {p^-pO{n-')T + (^'') <.Cf{p^-pO{n-')r^O{n-') . 

Plugging this a priori bound back into (3.17) we see that: 

At+i = /A, + p3 (p + -pO{n-^)f O (n-i) + O (n'^) . 

Using (3.19) again and choose p small enough to obtain 

= (Ao + 0(p3))^2t^Q(^-i) _ 
as desired. □ 

4. Mixing in the Subcritical Regime 
In this section we prove Theorem 1 . Recall that ai = ai(/3, q) = 2{i-2p/q) 

t"^(n) = ainlogn ; t"^ (n) = ainlog n + 7n. (4.1) 
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4.1. Proof of Lower Bound in Theorem . 

Proof. The analysis in this subsection pertains to all f3 < (3c{q)- Fix < p2 < Pi < POi 
where po is given in Proposition 3.7 and let do G S„ be such that p2 < ||sb||2 < Pi- Then 
if t < t"^(n) and pi is small enough, Proposition 3.7 implies 

for sufficiently large —7 depending on p2 and large enough n. Combined with the uniform 
variance bound given in Lemma ..>,•), it follows that for large enough n 

-(1-2/3/9)7/2 



n 



Applying Chebyshev's inequality and using Lemma 3.5 again, we conclude that uniformly 
in all r > 0, t ^ t^^{n) and do G \ 2^^+ 



^\ ^ Yai^oiSt) ^ nrrp-(i-2/3/g)7/2 
\/n J 

In particular, this implies 



5*11. < ^ j ^ . ,-a-i;;::r . .2 = o( (e-^-^'^/"^-/^ - 0- ) . (4.2) 



lim limsupP^o ( Il^t"ifn)ll2 < ^ ) = 0- (4-3) 

On the other hand, IE/x„5't = e and from Corollary 3.6 it follows that Yar^^St = O {n^^) 
for (3 < f3c{q)- Therefore another application of Chebyshev's inequality yields that 

Pn(\M.<^) ^1-^, (4.4) 



for all t ^ 0. Altogether, we have that for any r > 0, 

lim liminf d,«i/„s,(n) ^ 1 — ^^}^ 

and it remains to send r — ?■ 00. □ 

In the remainder of the section we prove the upper bound on the mixing time when 
f3 < /3s{q)- The proof is based on upper bounding the coalescence time of two coupled 
dynamics, one starting from any configuration in S„ and the other starting from the 
stationary distribution /i^. This coupling will be done in several stages with different 
couplings from one stage to the next. In what follows, {(7t)t ^ and {a)t ^ will denote 
the two coupled processes. 

4.2. 0{n~^^^) from Coalescence. We now show that with arbitrarily high probability, 
St gets 0(n~^/^)-close to e in 0(n log n) steps, if initially its distance is at most p, where 
p is small enough. More precisely. 

Lemma 4.1. Fix f3 < q/2. Then for all r > 0; 

uniformly in G Sn'^ where po = Po(/3, ^z) is defined in Proposition -.' 7 and i"i(n) is 
defined in { . ) . 
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Proof. This follows immediately from Proposition and a first moment argument: 

JEcrol|'S'^"l(n)l|2 (lEo-o||5'tai(n)||3) ^ _ m 
^ -1/2 ^ -1/2 ~*^\rj- '-' 

rn rn 

4.3. 0(n~^) from Coalescence. To get the correct order of the mixing time it is not 
sufficient to simply use the drift to couple the chains as the drift is very weak when St is 
close to e. As such, in this section we define a different coupling of the dynamics which 
will bring at and at to distance 0(n~^) apart in linear time. This will be achieved one 
coordinate after the other. We begin by giving a general definition of, what we call, a 
semi-independent coupling and then use it to define the coupling of the dynamics. 

Let v, v be two positive distributions on = [1,™-] and fix a non-empty A C 
where m is some positive integer. We shall write v\a for the conditional distribution 
given A, i.e. = v{x) /v{A), for x ^ A. The A- semi-independent coupling of v is 

a coupling of two random variables X and X with underlying measure P*, constructed 
according to the following procedure: 

(1) Choose ?7 G [0, 1] uniformly. 

(2) If f7 ^ min {1/(74), ?(^)}, draw X and X using a best coupling of {v\a-, ^'U)- 

(3) Otherwise, independently: 

(a) Draw X according to v\a if f/ < vi^Ai) and according to v\a'^ if U ^ ^(^)- 

(b) Draw X according to if [/ < i^{A) and according to V\a<: if U ^ ^(^)- 
Clearly a r^m-semi- independent coupling is a best coupling and for ^ = 0, we define 
0-semi-independent coupling to be the standard independent coupling. The following 
proposition states a few properties of this coupling, which will be useful for the sequel. 

Proposition 4.2. The following holds for the A- semi-independent coupling of {v^v): 

(1) X, X are distributed according to v, v respectively. 

(2) P* (u,eA{X = x}A{X = x}) ^ I l^(^) - 

(3) Vx ^ A, P'^ (^X = X, X / x) ^ v{x)V{A'' \ {x}) and 
Vx iA, P* fx = X, X / x) ^ u{x)u{A'' \ {x}). 



Proof. Part one of the Lemma is immediate. Part (2) follows from a straight forward 
calculation: 



yj^^A{X = x}A{X = x}j ^ {U ^v{A) Au{A))F* [X X\U ^v{A) AV{A)j + 

{v{A) A V{A) <U < u{A) V 1^{A)) 
^ (j^iA) A V{A))l |(z.U(x) - V\Aix))\ + HA) - ^A)\ 

As for part ( ), we have: 

P* (^{X = x} \ {X = x}) 5s F* {U > iy{A)Vu{A))u\Ac{x){l-V\A-{x)) 

^ i/(x)?(^^ \ {x}) 
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and similarly for ¥*{X = x,X ^ x) 



□ 



We are now ready to define the coupling of at, at for this section. Fix ao,ao £ S„ and 
yi, . . ■ Vq-i > 0. The coordinate- wise coupling with parameters yi, . . . , Vq-i and starting 
configurations fio, ?o is defined as follows. 

(1) Set r(o) = 0, = 1. 

(2) As long as A; ^ g — 1: 



(a) As long as St - St 



> 



Vk. 



(i) Draw It+i, It+i, using a {1, . . 

St- 
ill) Draw Jt+i, Jt+i, using a {1, 



, k — l}-semi-independent coupling of St, 
. . ,k — l}-semi-independent coupling of 



{St - Ji^It+i)^ dp [S^ 



-er 

n h+i 



(iii) Change a uniformly chosen vertex of color I^+i in at to have color Jt+i 
in at+i. 

(iv) Change a uniformly chosen vertex of color I^+i in at to have color Jt+i 
in at+i. 



(v) Set t = i 



(b) When 



S^ - St' 



1. 



^ a set tC') = t - T^'-^'^ &ndk = k + l. 
(3) Set T^^ = YXX'^^^^- 
We shall use to denote the probability measure for this coupling and p*^*^!™^ for the 

same coupling, only with k = m instead of = 1 in step ( I ), i.e. starting from the m-th 
stage. Notice that, in principle, the stopping condition at stage k, may never get satisfied, 
in which case we stay at that stage forever and T^'^^ = T'^^ = oo. 
For n, r > 0, define 



ft. 



{a, a) G S„ X S„ : ||s[^'''l < -, max - e||2, ||s - ejla) < 

n 



where above (s,s) = {S{a),S{a)). Finally, set T-Lu^r — 'Hu,r- The following lemma will be 
the main ingredient in an inductive proof for an upper bound on T*^*^: 

Lemma 4.3. Fix (3 < q/2. Let k G — 1]. For all nfc_i,rfc_i,e > 0, there exist 
yk,Uk,rk,^k > 0, such that if {ao,ao) G 'Htk.-^,- - ^^^'^ 



,rk-i 



Proof. Recall the expression for the drift of one coordinate 
Taylor expansion for any i £ [l,q]- 



(4.5) 

I ). Near e, this becomes by 



1 



1 



Sl + \\s 



)} 



+ 0(n- 



and if Wt = St — St then 

t;:,ff[Wt'+i-wt'\st,st] = ^{- 



l_^\w,^ + 0{\\St 



m + \\st 



(4.6) 



Now, for some > to be chosen later, let t^'^ = inf |t : ||5( — e||2 V \\St — e||2 ^ :^}- 



Then, from (4.6) it follows that there exists yk > such that Wt 



IS a 
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supermartingale. Clearly 



I, t. 



< |. Also, from Proposition 4.2, if t < t^^) /\T^'^): 



Ft) ^ P2.f = Jt+i + k, It+i / k 



which implies that '¥p'^^'^ 



t 1. 



+ O n 2 



hand, in view of ( 1.6), IE ~ 



1. t 



^ 55^2^-2 _^ Q (^n-^/2y On the other 
= 0(n^'^/^). Combining the two bounds. 



we infer that there exists C > 0, which doesn't depend on r/. or y^, such that on 

~" ^Cn-^ (4.7) 



(Tn.crn V 1^+-'- 



for n sufficiently large. We now apply Lemma 2.3 with Zt = , zq 
T-W A T^'^). This gives for 7^ > 0: 



2rfc_ 



i and AT 



whence we may choose 7^ = jki^k-i, e) independently of r^, but sufficiently large, such 
that 

(4.8) 



jcc(fc) Mfc) ^^(fc) 



This gives an upper bound on T^^^ since by Proposition 3.3 Part (2) we may choose 
large enough such that: 



(4.9) 



It remains to ensure that we do not increase the distances in the first k — 1 coordinates, 
by too much. Proposition 4.2 implies that for any t: 



cro,0"o \ ^ I ' 



^ o E (r;i + b^(5*)-5^(5t)l)+0(n-i 



/ sg A;-l 



^ C\\W^^''-\+rlO{n-^). 



It follows that 

jgCC(fc) 



t+i I 



1^1 



1 + 



n 



iTrr«ii.(i+^)+.jo(„-). 
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Taking expectation of both sides and using the assumption on \\Wq ' ||i, we have 



n / n 



Hence by Markov's inequahty, there exists > such that 

pCC(fc) / ,,-^[i,k-i].. uk-yk \ e 
^.0,^0 [W^^.n 111 > ^ j ^ 3- 

Combined with ( ' ) and (4.9), the proof is complete. □ 
Corollary 4.4. Fix /? < q/2. For any e,r > 0, there exist 7, u, r' > and yi, ■ ■ ■ yq-i > 

r 

such that for (Tq, ctq G • 

Proof. Starting from rQ = uq = r and applying Lemma ^.-j inductively, we obtain for some 

{yk,Uk,rk,-fk)k(i[l,q-l]- 

pSo(^^'^ < 7^™, (^T(^)>?T(^)) e VA; G - 1]) ^ 1 - 6. 

It remains to set 7 = Yl'k=i Ik-, f' = fq-i and u = 2iiq_i. □ 

4.4. Coalescence of Proportions Vector Chains. The next lemma completes the 
coupling of the proportions chains. 

Lemma 4.5. Fix (3 < q/2. For all r, u, e > there exists 7 > such that if ao,ao G S„ 
satisfy {ao,ao) G 'Hu,j- and t ^ jn, then 

where under F^^~^^, the processes {crt)t^o, (p't)t^o evolve according to the synchronized 
coupling, as defined in Subsection 

Proof. If p is small enough, it follows from Lemma that 

rrn rrn II 1 1 ^ ^ I 



2n y n 

Combined with Proposition 3.3 Part (1), this implies that there exists 7 = j{u) such that 
lE^'^- — ^ ^ for t ^ 7n. Then by Markov's Inequality: 



4.5. Basket- wise Proportions Coalescence. The next coupling will allow us to turn a 
well-mixed proportions chains into a well- mixed configurations chain. Let *B = {Bm)'^=i 
be a partition of We shall refer to Bm as a basket and call 55 a A-partition if 

\Bm\ > An for all m. Given a G S^, let S((t) denote a q x q matrix whose (m. A;) entry is 
equal to the proportion in a of color k in basket m, namely 



S is an element of S = 11^=1 ^ and we define SP = l\'L=i and = nm=i 5''+. We 
also let i3[mo,mi] = Umo^msgmi'Sm and as before use Sj as a shorthand for S((Tt). The 
following is an analogue of Lemma ' . ! for the basket proportions matrix. 
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Lemma 4.6. Let ^ be a X-partition for some A > 0. // either of the following holds: 

(1) fJo G S^" and t"i(n) < t ^ e''(''o)" where po, -yipo) are given in Proposition o. i and 
t"^{n) is defined in (4.1). 

'•Q 

(2) So € and t ^ jqu for some ro,7o > 0, 
then 



O"0 



0{r- 



where the 0{r ^) term is as r ^ oo, uniformly in n. 

In order to prove Lemma 4.6, we use the following proposition to bound the second 
moment of the basket proportions matrix. 



Proposition 4.7. is a X-partition for A > and m,k £ [^,q], then 



E 



Proof. Let Aq = \Bm\/n ^ A > and set Q 



m,k 
t 



S^f + o 



S^. Then: 



(4.10) 



E 



Pi((Q 



m.k , 1 \2 



+ 



(Qr'^)2)+P2((Qr-i)^-(Q^ 
,^ + ^)'-(Qr')')+P4((Qr'^ 

(pi-P2)+(l-i^) (p3-P4))+0(^) , 



vm,fc 1 \2 



P3((Qr'' 



9 

n 



m,k\2\ 



+ 



(Q 



m,k\2\ 



(4.11) 



where (denoting by Vt+i the chosen vertex at step t + 1): 

PI = P(^t+i i Bm, at{Vt+i) = k, at+iiVt+i) / k\Tt) = (Sj - AoSr'')(l - g'fsiSt)) + 0{n-'), 
P2 = nVt+i i B^, at{Vt+i) / k, at+i(T4+i) = k\Ft) = (1 - - Aq + XoST'^)g%St) + O(n-i), 
P3 = IP(V^+i G at(yt+i) = k, at+i{Vt+i) / A:| Ji) = AqS^'^I - 5^(5t) + ©(n"^), 
P4 = P(^t+i G at{Vt+i) / A;, fTt+i(Vt+i) = k\Ft) = Ao(l - Sr''^)<7^(S0 + 0{n-^). 
Plugging these into (4.11), we obtain 

mK-A - sir? I H = (1 - - S^f + O(^) 

as required. 

Proof of Lemma 4-6. Taking expectation in (4.10) and applying ( ' ".') one gets: 



□ 



gm,fc _ gk 



^ 1 



■^0 '-'o 



+ 0(n-i). 



In both cases (note that Qi(/3,g) > 4 for all (3 < q/2), it implies Eq- 
0(n~^). Summing over all m and A; and using Markov's Inequality we get 



gm,fc _ gk 



\\ST-St\U>r/{2V^) I =0(r-^). 

1 m q 

Now, by Proposition 3.7 Case (I) and Proposition Case ( ') we also have that 

P,o(||5t-e||, >r/(2V^))=0(r-2). 



27 



Combining the two, we complete the proof. □ 

Suppose now that you have two initial configurations ao,^o, such that so = so- The 
following is a coupling under which eventually (with probability 1) also St = St- Equality 
is achieved one basket at a time, indexed below by m and once the proportions in a basket 
are equated they remain so. We shall call this coupling Basket-wise Coupling and denote 
by ]p-^<^ the underlying probability measure. 

(1) Set i = 0, m = 1. 

(2) As long as m ^ g: 

(a) As long as / Sf: 

(i) Choose "old" color It+i according to distribution St = St- 

(ii) Choose "new" color J^+i according to distribution (5t — ^e/^^^) = 

9f} {St - ^e/.+i). 

(iii) Choose a vertex V^+i uniformly among all vertices in having color 
It+i under at- 

(iv) Choose Vt+i: 

(A) If Vt+i € Bmo for mo < m, choose Vj+i uniformly among all vertices 
in Bmo having color /t+i under at. 

(B) Otherwise, if 8^'^*+' S™'^*+^ and 8^"^*+' / S^"'''-', choose Vt+i 
uniformly among all vertices in B[m,q] having color It-\-i under at- 

(C) Otherwise, let f i, f2, . . . be an enumeration of the vertices in B[m,q] 
having color It+i under at ordered first by the index of the basket 
they belong to and then by their index in V and let vi,V2, ■ ■ ■ be 
the same for at. Then set Vt+i = Vi where i is such that Vt+i = Vi. 

(v) Set at+iiVt+i) = at+i{Vt+i) = Jt+i and t = t + l. 

(b) Set m = m + 1. 

The following lemma gives an upper bound for the time of basket-wise proportions 
coalescence. 

Lemma 4.8. Fix f3 < q/2. For any A > 0, r > 0, e > 0, there exists 7 = j{X,r,e), such 
that for any X-partition and any ao,ao such that Sq = Sq and Sq, Sq E S^, 

<'>o(S7n = V) (4.12) 

Proof. From the definition of the coupling, once the proportions of basket m have coalesced 
they will remain equal forever. It suffices, therefore, to analyze the coalescence time of 
each basket separately. Note also that the coupling preserves the equality St = St for all 
t^O. 

Define = 8* - 8*, = ||WJ"||i and let r(°) = and r^*^) = min{t ^ r("^i) : 
= 0} for m G [1, g]. Also set 

n = inf{t : St^SP or St ^ S^} 

for p > sufficiently small and ri™"* = r^™) A r*. We claim that for all m, (VF™)t ^ is 
a supermartingale between r^™"^) and t(™) as long as is not reached. In order to see 
this, fix m, t and assume t}. Then at step (2(a)iv), according 

to the coupling, there are 3 cases: 

(A) Clearly 8™ ^ = 8^ and S^^ = 8^ and hence W^^^ = W^. 
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(B) Notice that in this case, we have 
Therefore, 



mJt- 



^W;^'[*+^ ^ and wr'^'+^W,"I'f*+i ^ 0. 



t+i 



t+1 



wr\H 



|W 



t+l I 

mjt+i I 
t I 



W 



m,It+i I 



t+i 



mo ^ m t 



+ 



|W 



BC 

m,It+i I 
t I 



w 



rn,Jt+i I 



mo I 



mo ^ m t ' 



mo I 



(C) If V^+i,yt+i G i3™ or ^ then t^^^ 

struction we must have: 



W7", otherwise from the con- 



I** t+l I 



|W 



1 

\Bm\ ' 



as weU as 



|^m,Ji+i| _ |^m,Jt+i| 



t+l I I "i I \Bm\ ■ 

Summing these two, we obtain a non-positive drift for W^. 

Observe that as long as t^, is not reached, both Ya.i^'-' {W^i\Tt) under case (B) and the 

probabihty that this case happens are bounded below uniformly in n and t. This gives a 

uniform lower bound on the variance Yar^'-^ {Wl^i\J-t) ■ Furthermore, if for some t, we have 

~ -111. 
St, St G Sv^, then in view of Lemma after 7'n time, we have St+yn, St+yn G 

with probability 1 — 0((r")~^). Therefore, using Lemma 2.3 we may find 71, ... , 7g_i such 
that inductively, conditioned on ri"* ^ 7^-1?^ with probability at least 1 — e/{2q) we 
have ri*"^ ^ 7m^^• This in turn implies that ri'^ ^ 771 with probability at least 1 — e/2, 
where 7 = 7g-.i. 

It remains to bound t^, below with high probability. Let B"^'^ 
and 



{t: 



l/q\ ^ p/2, 1 ^ t ^ 7n} 



Using Lemma l.ti we obtain that 



<So[^"'']^7r^O(^) = 0(7) 



Then as B"^'^ imphes that F™'-'' > 



nXp 



r^ApX ^ 2Efo'^o[^'"''] 



nXp 



0{n 



-1^ 



Summing over all m, j and arguing the same for St we obtain 

IPfo^5o(^*<7^) = 0(^"'), 

Finally by a union bound we have Pf^- (r^?) ^ 7n) ^ 1 - § + O(n-i) as desired. □ 

4.6. The Overall Coupling. We now describe precisely how the previous couplings are 
combined together to create the overall coupling. This coupling will be the main tool in 
proving the upper bound. Formally, let 71,73,74,75 and yi, ■ ■ ■ ,yg-i be positive numbers 
and do E Sn- The overall coupling with parameters 71,..., 75, and initial 

configuration o"o is a coupling of two chains {crt)t-, (^t)t under measure P^o*^. The initial 
configuration for (crj)t is ctq, while cjo is chosen according to Then, the two processes 
evolve as follows. 
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(1) Run at and at independently until time t^^\n) = 'jin. 

(lA) Partition the vertex set [l,n] into baskets 03 = {Bi, . . .Bq) such that Bk = 
{'^ ■ '^tW(n)iv) = k} for A; G 

(2) Run at and at independently (again) until time t(^)(n) = t^^^ +t°i(n) time where 
t°'^{n) is defined in (11). 

(3) Run at and at according to the coordinate-wise coupling with parameters yi, . . . , yq-i 
until time t^^\n) = t^'^\n) + 7371 (unless stopped before). 

(4) Run at and at according to the synchronized coupling until time t^^^ (n) = t^^^ (n) + 
74n time. 

(5) Run at and at according to the basket-wise coupling for t^^\n) = t^'^\n) + 7577, 
time with the baskets above. 



4.7. Proof of Upper Bound in Theorem ; . We will now use the overall coupling 
with appropriate parameters to establish the upper bound of the mixing time. Recall that 
(3 < /3s{q). Fix e > 0, pick p > small enough and let ctq be any initial configuration. By 
Proposition •J.-j Part (•.>), we can choose 71 large enough such that 



■ 0-0 



> 1 



e . 



(4.13) 



Assuming that this event indeed occurred, 5S is a (^ — /9)-partition and provided that p 

are satisfied. From the latter we conclude 



is small enough, the conditions in Lemma 
that for some r > 0, with probability at least 1 — e, 'S'j(2)(„) 



G 5 ^ . On the other hand, as 



in (4.4) with probability at least 1 — 2e we also have 5'j(2)(„) £ if r is large enough. 
Then Corollary 4.4 and Lemma 4.5 ensure that there exist yi, . . .yq-i and 73, 74, such 
that 5*^(4) („) = 5'^(4)(„) with probability at least 1 — 3e. From Lemma we have that 

Sj(4)(„), S^(4)(„) G with probability at least 1 — 4e for some r' > 0. Then, by Lemma 
4.8 we may choose 75 such that S^(5)(„) = 8^(5) with probability at least 1 — 5e. 

Now, by symmetry, for any t ^ t^^\n) the distribution of at, given J'tW(^n)^ invariant 
under permutations of the vertices in each basket of B and the same is clearly true for 
Therefore we conclude that 



3,0c 

" CO 



»t{5)(n) 



-li 



7^ St{5)(n) ^tW{n)^ ^tW{n) ^ '^^ 

Then from Jensen's inequality we obtain 



^ 5e. 



^ 5e + e 



oc 
6e. 



G • 



^ ((i)(n) 



/^n 1 1 TV 



+ 



dOC 



(4.14) 

t(i)(n) ^ , 



Now t^^\n) = t"i(n) (as defined in ( ')) with 7 = 71 + 73 + 74 + 75 and since ctq is 
arbitrary and e can be made arbitrarily small, by choosing 7 large enough, this establishes 
the upper bound for the cutoff. □ 
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5. Mixing in the Supercritical Regime 

5.1. Proof of Theorem We first give the proof for the case fSg < (3 < (3c- Recall 
(Subsection 2.2) that I3c{q) = \og{q - 1) for g ^ 3 and /3c(2) = 1. We claim that for 
g ^ 3 

/3,(g)(l-l/g) </3,(g-l). (5.1) 

It can be checked for g = 3 and for g ^ 4, it suffices to prove that f{x) := J^~^2) log(^ ~ 1) 
is decreasing in x on [3, oo). We compute the derivative and obtain that 

. 1 fx^ - 2x + 2 ^ , , \ 

fix) = —, ^ —, ^ log X - 1 - 1 , 

x{x — 2) V x{x — 2) ) 

which is negative for x ^ 3. 

Now fix (5 > and notice that if S"^ G [I/?) l"*^]! conditional on {^S^ = s^}, {S^ / {1 — s^) : 
2 ^ i ^ q) is distributed as the proportions vector for the {q — l)-states Curie- Weiss Potts 
model on (1 - s^)n vertices with f3' = /3{l-s^) ^ (1-1/g)^ ^ {1 - I / q) (3ciq) < f3c{q-l)- 
Therefore, for all 5i > 

/z„(^32 ^ f ^ g such that \S' - yl ^ Si \ S'^ = s^^ ^ . (5.2) 

as n — 7- oo uniformly in € [1/q, 1 — 5]. Also uniformly in s G 5„, recall that: 

E [Sli - Sl\St = s] = ^d^is) + O (n-2) , 

where (i^(s) = —s^ + g^{s). It now follows from the uniform continuity of (i^(s) and (5.2) 
that uniformly in S"*^ € [l/q,l — 6], 

^f^JSli - Si I Si = s'] = i{Df,{s') + 0(1)) , 

where D/^is^) = dfs [s\'^, . . . ,'^). 

Now if /3 > f3s{q), from Proposition 3.1, there exists ^2, such that Dp{s^) is uniformly 
positive in a 52-neighborhood of s*(/3). All together we infer that there exists e > such 
that uniformly in € (s*(/3) — 82, s*{f3) + ^2) for all n large enough: 

^^.AsUi-sl\sl = s^\-^'-, 

and also 

= '\ + i\St= s\) - PMn(^/+i = s\ + }, + ^-\Sl = s\ + }^) = 0(1) , 

for j G —1, 0, 1 where the last inequality follows from the concentration of the conditioned 
measure as well as the continuity of the probability to stay put. These two formulas 
together imply that 

ip^„(^t+i = s\ + -\sl = s\) ^ x^^ssU = 4 I si = s\ + -), 

for some fixed constant A > 1 for all E (•5*(/3) — 82, s*{P) + ^2) when n is sufficiently 
large. Since {St)t ^ is a- reversible Markov chain, with /u„ its stationary measure, 

^fj-ni^t+i = '^1 + ~") 'S't = •5i) = IP^„ (5*4+1 = ^1, 5"^ = Si + — ) , 
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and therefore, for all s G {s*{(3) — 62, s*{(3) + 62) 

n 

and hence 

finiS^ = s*il3) + 62) ^ X^^^^i^niS^ = - ^2) . (5.3) 

Now select the set ^ = {5i ^ s*(/3) - ^s}. By ( ), ^^^^Q^ ^ X'^^", where 

dp^A = {x e A : Pn{x,y) > for some y ^ A} 

and Pn is the transitfon kernel of the Glauber dynamics. Since (3 < (3c{q) and s*(/3) — 82 > 
1/q we also have fJ-niA) = o(l) as n — )• 00. Therefore Cheeger's inequality (Theorem 2.5) 
immediately implies an exponential lower bound on the mixing time. 

The case (3 ^ Pc{q) is simpler. As the large deviations analysis in Subsection 2.2 shows, 
we may find A = {\\S — sp^q\\2 < 5}, where sp^g is defined in (2.4) and 5 > is small 
enough such that limsup^^^ log fin{dp„ A) < and liminf„_!.oo ra"^ log;U„(74) = 0. 
Since symmetry implies f^niA) ^ 1/q (if S is sufficiently small), exponential mixing time 
follows immediately from another application of Cheeger's inequality (Theorem 2.5). □ 

6. Mixing Near Criticality 

We now assume (3{n) = (3s{q) — ^(?t-), with ^(n) — >• as n — )■ cx). Once /3(n) approaches 
Ps with n, we no longer have a uniform negative upper bound on the drift to the right 
of 1/q for each coordinate. Instead, near s*{/3), the drift will be of order ^(ra), possibly 
even positive and hence it will take longer than linear time to get close to e and this 
may have an effect on the order of the mixing time and cutoff window. Accordingly, in 
addition to the coalescence time analysis near e, one has to obtain sharp asymptotics for 
the passage time near s*{(3). This is achieved using several propositions which we state in 
Subsection 6.f . Their proofs will be deferred until the end of the section in favor of first 
showing how they are used along with the previous coalescence analysis to find the mixing 
time near criticality which gives the proof of Theorem 2. 

Both the analysis and the results in Theorem 2 are qualitatively different, depending 
on whether ^(n) decays faster or slower than some threshold rate. Accordingly, we shall 
distinguish between two regimes and write: 

e G [CR] if lim„^oo n2/3^(n) = 00, ^(n) = o(l) . 
^ G [NCR] if ^ liminf„_^oo n2/3^(n) ^ limsup^^^o ^^/^^(n) < 00 ^ ' ' 

([CR] stands for Cutoff Regime and [NCR] stands for No-Cutoff Regime). For a > 0, 
define also 

= ! *7fe+^(w""") ««^>^'" (6,2) 

i eTn4/3 if ^ G [NCR]. 

Both ( ) and (G.2) will be used for sequences other than ^ as well. We shall also employ 
the following notation for hitting times. Given a real-valued process {Xt)t ^ and a number 
X G M we shall write 

= ml{t : Xt ^ x} and r~ = inf{t : Xt ^ x} 

for the right and left hitting time of X at x. Notice that this notation does not carry an 
indication for the process for which is a hitting time and in case this is not clear from 
the context, it will be mentioned explicitly. 
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6.1. Drift Analysis Near s*(/3). The following proposition states several properties of 
the function Dj3 near s*{f3). 

Proposition 6.1. For all q ^ 3 the following holds: 

(1) The point s*{l3s) is the unique s E (^) 1] such that Df^^{s) = 0. 

(2) For k = 0,1, . . . , the functions D'l{(3) = ^Dp{s*{(3)) are C°° in a neighborhood 
of f3s. Furthermore: 



D*om > 0. 



• D*^{f3,) < 0. 
(3) For all p > 0, there exists 6 > such that: 

sup [Dp{s) : se(^l+p,l\,\s-s*m>p,\/3-l3s\<s}<0. (6.3) 

The next lemma gives sharp asymptotics for the passage time near for a process with 
certain drift assumptions near (given by (6.4) below). The one coordinate process will 
fall into this category if we analyze it near s*{/3). 

Formally, let ((Zt)"^ q ; n ^ O) be a sequence of discrete time processes. For all n, 
suppose that {Zt)^ = (■^t)"> o adapted to (Tt)^^ o' satisfies n\Zt+i — Zt\ G {—1, 0, 1} 
with probability 1, and 

nZt+i - Zt\Ft] = ^ (C(n) + aZ^ + bZf + O (C(n)z2 + zf)) (6.4) 

where a > 0, 6 G M and C(^) is a sequence satisfying C{n) — )■ as n — )■ oo. We allow both 
C G [CR] and C, £ [NCR], but in the latter case, we assume in addition the existence of 
d > such that for all n 

Ysii[n{Zt+i - Zt)\Tt] > d. (6.5) 
Write for the probability measure under which this process is defined and starts from 

Lemma 6.2. Fix p > sufficiently small. Then for zq = —p there exist functions L*,U* : 
(—00,00) —7- [0, 1] satisfying lim^_!._oo L*('y) = lim^_i.oo U*{j) = such that for all 7, 

limsupP,,, (r+ >t^''^(n)) ^ U*{j), (6.6) 
hmsupP^o (r+ <t^'"(n)) ^ L*{j), (6.7) 

where is a hitting time for Z . Moreover, if C ^ [NCR] we can chose U* such that for 
all 7 we have 

U*{-f) < 1. (6.8) 

Remark 6.3. The upper (lower) bound in the lemma still holds if {Zt)t ^ satisfies (6.4) 
with ^ ( ^ ) in place of the equality sign or if in place of zq = —p we have zq ^ — p 
(zq ^ ~p)- Since {Zt)t^Q has 0, ib^ steps this can be shown by a simple coupling argument. 

The next proposition shows that the drift of one coordinate stays close to its upper 
bound D/3(-) for sufficiently long time. More precisely, for ctq G Sn, t > 0, 5 > 0, y £ [0,1] 
let 

Kn{ao, t, y, 6) = I max Dp{S'g) - nE^, [S^^, - Sl\ J'e] > A , (6.9) 

\ s£ e ^ min{t,T~} / 
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where Ty is a hitting time for S}. Then, 

Proposition 6.4. Suppose that /3 ^ q/2 and set ctq = 1. Then for any y > ^: 

(1) If t{n) = o{Tn?) and 5{n)n'^t{n)^^ ^ oo then liioin^oo Kn{croTt{n),y, 5{n)) = 0. 

(2) If t{n) = 7n^/'^ then for all 6 > we have 

hm hmsup Kn{ao,t{n), y, = 0. 

7->'0 n-5-oo 

6.2. Proof of Theorem 2. 

6.2.1. Upper Bound on Mixing Time. Fix p > small enough and let do G S„ be given. 
By Proposition Part ( ), we can find 5 > so that 

sup{D(3{s) : \f3- f3s\ < S, l/q + p/2 < s < 1, > p/2} < -5. (6.10) 

where we use s* in place of s*(/3). Then by Lemma 2. 1 Part ( . ), we have that, 

where this and all hitting times below are of Sj. Define now Zt = s* — S} _ . Using 

(3.1), (3.3) Proposition 6.1 and applying Taylor's expansion for D^(s) around s* and then 
again for Dq(I3) around /3s, we infer that there exist a>0, 07^0, 6gM such that 

E[Zt+i - Zt\Tt\ ^ ^ (C(n) + aZl + hZl + O {C,{n)Zl + Z^)) , 

where ({n) = a^(n) + 0(^(n)^ + n~^) and also ( ) holds (if needed), since the probability 
of choosing any new color at time t + 1 is bounded above and below, uniformly in n and 
St- Hence by Lemma and Remark , for all 7 

P^o {r(s*-,) - > ^ + 0(1) 

Now, using the relation between C('^) ^ind ^(n), it is not difficult to verify that t^'°(n) ^ t^f (n) 
for all 7', where a' = aa and 7 = F{'j') for some F such that 7 — )• 00 if 7' — )• 00. 

From Lemma 2.1 Part (3), applied to the process {S} _ — {s* — p) : t ^ 0), it 

follows that with 1 — o(l) probability _ stays to the left of s* — p/2 for all t < n^. 

Then we may apply Lemma 2.1 Part (1) to the process {S] _ — {\/q + p/2) : t ^ 0) 

to conclude 

Finally another application of Lemma 2. 1 Part (3) gives S} _ ^1 /q+p for all t < 

with 1 — 0(1) probability. For the [CR] case, we use union bound (over all coordinates): 

IP^xo (5,4,.' i SP^^ ^ qU*{^) + 0(1). (6.11) 

For the [NCR] case, define r^^) = and r^*-') = inf{t ^ t^^-^^ : S^ ^ q-^ + p/2] 

for k > 1. Then, by inductive conditioning we obtain 

P.o (^^'^ ^ : k = l,...,q) ^(l-f/*(7))« + o(l). 
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Since also S^_^^f^^.^ ^ 1/(7 + p for all G [l,q], t < with 1 — o(l) probability, we arrive to 

P.o (5,5V („) ^ 5^+) ^ 1 - (1 - U*ij)y + 0(1). (6.12) 

We now re-employ the overall coupling in Sub-section but in view of (0.11) and 
(6.12) we change step (1) and instead of running the two chains for 7in time, we run them 
for i(i)(n) = tif{n). As (6.11), (6.12) show, we can choose 7' large enough such that 
^?!^{S ,a', i 5^+) ^ e forn sufficiently large. The remaining steps in the coupling are 

left unchanged and we choose the same parameter values, as in the proof of Theorem . 

Using the analysis of the modified step (1) given by (6.11) and (6.12), together with 
the analysis in Sub-section 1.7 of the remaining steps - which carries over (uniformly in /3 
near /3s{q)), since it only required (3 < (3c{q), we recover (4.14), namely 



The time is now given by 

t(5)(n)=t^i(n)+4f'(n), 

for some 7 > 0. Since ctq is arbitrary and e can be made arbitrarily small, by having 
7, 7' large enough, this completes the proof for the upper bound in ( ) and ( ) with 
02 = 7r/-y/aa. 

6.2.2. No Cutoff in NCR Case. Using the modified overall coupling as introduced above, 
we obtain from (4.14), (6.12) and (6.8) for any 7' and sufficiently large 7 



for all (To, large enough n and some e > 0. Then, since in the [NCR] case 

t(^)(n)=t^Kn)+<f'(n)^4'4(n), 
this shows that there is no cut-off. 



5{n) 



6.2.3. Lower Bound on Mixing Time. Fix p > small enough and start with (Tq = 1- the 
all '1' configuration. Define: 

n-^i{n)-^I^A{n) if ^ G [CR] 
5in-2/3 if ^ e [NCR] 

where A{n) is a sequence tending to 00 sufficiently slowly and 5i > 0. Set 

N = inf {t : Df,{Sl) - uE [Sj^, - \ Tt] > S{n)} 

and define the process Yt which is equal to up to time N, but after this time evolves 
like a birth-and-death processes with itl/n increments and drift —n~^Dp{S}). Then 
{Zt = s* - Yt: 0) satisfies 

E[Zf+i - Zt\Tt] ^ I iCin) + aZ^ + bZf + O {C{n)Z^ + Zf)) , 

with a, 6, a as in the upper bound case, but with C,{n) = aS,{n) + 5{n) + 0{(,{n)'^ + n~^) 
and condition ( ) holds (if needed) as before. Then, using Lemma 6.2 and Remark 6.3, 

we have for n large enough Po-o ( "^p^ < ^7 ° ('^) ) ^ '^^* il) ■> where is a hitting time for Z 



and 7 G M. As before, it is not difficult to verify that if A{n) is increasing slowly enough, 
t^'"(n) ^ ty" (n), where 7 = ^(7') satisfies 7 — )• —00 if 7' — )• —00. 
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Now define T = inf{t : St G S^^} and r' as t^, only with 5"^^ in place of Yt. Then 

(r < t«f ' (n)) ^ (t+ < t«f ' (n)) + (iV < r' A t^f ' (n)) 

^ 2L*(7) + Kn{ao, 'H, - 6{n)) 

where Kn is defined in ( ). Then, if p is sufficiently small, we can use ( .._) for St starting 
from time T to obtain for all r > and 7": 



-0 I ""^^^W+t^f'Hll^ < 



T<tif'(n)] + 



^ 2L*(7) + K„((7o, tif'in), s* - p, 6in)) + ©((e'^^^" - r)-^) 



Using Proposition for the middle term, the last inequality gives ( with t^},{n) + 

ise 
□ 



ty" iji) in place of i°^(n). The remaining of the proof is identical to the subcritical case 



and this shows the lower bound for both parts of Theorem 2 with a2 = t^/ ^faa. 
6.3. Proofs for Subsection 6.1. 

Proof of Proposition 6.1. First observe that for all /3, -D^(-) = and for all s > 



d 

dp 



d 

~dp 



-s + 



1 



l + {q-l)e 



Ma 



> 0. 



Now since Di3{s) is smooth as a function of s and /3 and (io(s) = —s + ^ it follows that 
Ps > 0. We have that 

d 



and so 



ds 



ds 

d^ 
ds^ 



DM 



-1 + 



2/3ge 9-1^ 1' 



2 ' 



l + {q-l)e 
{q-l){l + {q-l)e 1--^^ ^> 



-1 + 



2/3 



ds2 



^ 4f3\q-2) 



> 0. 



(6.13) 



(6.14) 



which implies that dg/2{s) > when s G (gi g + some small e. This implies that 

/3s < q/2. It follows that 



ds 



< 0. 



and so Dp{s) < when /? G [/3s, /Ss + e] and s G 7; + e) for some small e. It follows 



Mj' q 



by compactness then that for some - + e ^ s*(/3s) ^ 1 that D^^(s*(/3s)) = 0. By the 
definition of /3s and since Dp{s) is smooth we have that 



d_ 
ds 



Dp As) 



8=8* (/3s) 



ds^ 



DpM 



^ 0. 



(6.15) 
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The equation ^Dp^{s) = is equivalent to 



2Psq („ 1-1 

2/3ge 9-1^'' <i> 



l + {q-l)e 



g-l \ q I 



which is a quadratic equation in e i' and hence has at most 2 solutions which we 

denote si,S2 with si < S2- Since ^D^^(O) < then Di3^{si) < and so s*{(3s) = S2- 
In particular this implies that s*{Ps) is the unique s S (^i 1] such that D[j^{s) = 0. Also 
it follows that ^gDp^{s) > for s G (si,s*(/3s)) and that there exists s' E (si,s*(/3s)) 
such that -^Dp^{s') = 0. Since by equation (6.13) there is at most one s such that 



Dp^[s) = it follows that 



< 0. 



Hence by the Inverse Function Theorem s*{f3) is a smooth function of /3 when /3 is in a 
small neighborhood of /3s . Then we have that 



d 

d/3 



d 

~ dp 
d 

~ d(3 
>0, 



+ 



d 

d(3 



d 

ds 



s=s*{l3s) 



since ^-C'^s(s)|s=s.(^^) = which completes the proof of the second part. 

We now turn to prove the third part. As we have observed Df^^^s) is a smooth function 
satisfying 



d 



d^ 



0' and -^iy^^^s;|s=s.(/3,) 
Therefore, we deduce that for any p > 0, there exists 5i > such that 



< 0. 



Dp^{s) < -25i for all s G <j s : |s- s*(/3s)| ^ p/2, s G { ^ + P. 1 



(6.16) 



Since £iDp^{s 



< for all p > 0, there exists (52 > such that 



\s*{P) - s*{ps)\ < p/2 for ah \(3 - /3,| < 62 . 
Combined with (6.16), it follows that 

D^^{s) < -26i, for ah \s - s*(/3)| ^ p and |/3 - P^l < ^2 



(6.17) 



Now that Dp{s) can be viewed as a continuous function of (/3,s) and by compactness, 
there exists ^3 > such that \Dp{s) — Dj^_.[s)\ ^ 5i for all s € [I/9, 1] and |/3 — /J^l ^ 5^. 
Combined with (6.17), it completes the proof by taking 5 = 5i /\ 82 /\ 5^- □ 



Proof of Lemma 6. 2. We do not lose anything by assuming that 

nZt+i-Zt\Ft] = f{Zt) ; /(z) = i(C(n) + az2 + 6z3 + cz4)l[_^^^+^^](z) 



(6.18) 
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for some a > 0, b, c with po = 2/9 and that once Zt exits [—po, +Po] it is stopped. Indeed, 
having / vanish outside of [—po,+po], does not change the asymptotics of the passing 
time. Clearly, this is the case for z > +po. For z < —po, it follows from 

P_p(Tr,„ < 4'"(n)) = o(l), (6.19) 

for all 7, which is a consequence of Lemma 2,1 part (3) since the drift of Zt is at least - 
on [—po, —p] for some positive c uniformly in n (if p is small enough). 

As for replacing the error term in (6.4) by cZf, as the proof below shows, the functions 
U* , L* in the lemma restricted to condition (6.18) can be chosen to be continuous in a 
in a small interval [ag — e,ao + e] and the limits (6.6), (6.7) hold uniformly in a in this 
interval. This together with Remark 6. -3 implies the existence of U* , L* under which (6.6), 
(6.7) hold in the general case (6, I) with 4'"^^^'^^"''V)- Now, it is not difficult to see that 
the latter is bounded above and below by t^'^(^(n) for some C > and hence (6.6), (6.7) 

hold with t^'"'{n). Similar considerations apply for ( ). 
Set 

*(^)= / -JT^dx and Yt = ^{Zt)-^{Zo)-t. 
Jo J\x) 

The motivation behind the above definitions comes from a continuous time deterministic 
analog of ( ) in the form of an ODE 

m = f{z{t)) (6.20) 

for which z{t) = ^'^^(t — tg) is a solution (roughly speaking Yt measures how far behind 
or ahead "in schedule" Zt is, judging from its position). 
Start with the [CR] case and set 

1/2 

In the deterministic setting the time it takes for z{t) to pass from z{0) = —p to p is 

_p C{n) + ax^ + bx^ + cx^ ^ ^ ^ ^ 

if p is small enough. This will be shown in Proposition 6.5 below. Thus, bounding the 
passage time r+ around t'^'°-{n) can be achieved by bounding |Vjc,ti(„)|. 

Accordingly, let Yt = Mt + At be the Doob-decomposition of Yj, with Mj a zero- mean 
martingale and At the predictable process. The next proposition will allow us to bound 
Yt. The proof of this proposition will be deferred to the end of this section. 

Proposition 6.5. If p is small enough and C € [CR] then 



(1) ^{p) - ^{-p) = f^'^in) + 0{n). 

(3) A_^c,af^^^ = o{w^{n)) with probability 1. 
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Now using the monotonicity of ^ in [—po, +Po] we have 

^ P_, (y^c,.(„) < t<'-in) - t^'^'H + 0{nf 
^ P_p (M^c,a(„) < -jw'^in) + 0(n) + o (w^n)'^ ) 
0(u'^(n)) 



,(7 + 0(l))u;?(n), 

where the last inequality is a second moment bound. This shows (6.6). For the lower 
bound, if —7 is large enough, we may write 

P_p (r+ < 4''^(n)) = P_p (3t < 4''^(n) : ^ p) 

^ P_p (3t < 4''^(n) : ^t^''^(n)-4'"(n)+0(n)) 

% (^3t < t^''^(n) : Mt-^ --fw^{n)+0{n) + o(w^{n)^^ 



0(u;f(n)) 



--/ + 0{l))w<{n) 

where the last inequality follows from Doob's inequality. This shows (6,7) 

Next, we address the [NCR] case. We can no longer use the means analysis (6.20) 
throughout the entire passage interval [—p, p] as Zt is not concentrated around its mean 
near 0. Accordingly, we analyze the passage time in each of the following segments sepa- 
rately: 

[-p, -rn"^/^] ; [-rn""^/^, +rn"^/^] ; [+rn"^/'\+p], 
for some r > to be chosen later. 

We start with the upper bound. For the sequel, let w = rn^^^'^. The upper bound will 
follow if we show the following: 

(1) Segment [— p, —w]. For any 7, 

lim limsupP_p(r+^ > ^'"(n)) = (6.21) 

(2) Segment [+w,+p]. For any 7, 

lim limsupP^(r+ > t^'''(n)) = (6.22) 

r-s>oo n^oo 

(3) Segment [—w, +w]. For any r > 0, there exists n : M — )• [0, 1) such that 

lim sup P_^ (r+ > 4''^(n)) ^ n(7) < 1 , (6.23) 

for all 7. Furthermore, u{'y) — )• as 7 — )• 00. 
All are hitting times for Z. Indeed, by first choosing large enough r and then choosing 
large enough 7 both (6.6) and (6.8) will follow by multiplication. We proceed to prove 
each of the above statements. 

Segments [— p, —w], [+w, +p]. Here we can use the means analysis as in the [CR] case. As 
before, we do not change the asymptotics of the passage time through these intervals, if 
we assume that f{z) in ('< 1 satisfies: 

/(^) = UCin) + az"^ + bz^ + cz'^)t[-p„-w„]u[+wo,+Pi,]iz) 
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where a, b, c, po are as before, wq = w/2 and once Zt exits [—poi —wq] U [+wo, +po] it is 
stopped. Indeed this follows from the same reasoning and in addition since for any 7 

lim P^(t^,^ <4'"(n)) =0, 

uniformly in n (large enough) as it follows from Lemma 2.1 part (2) since the drift of Zt 
is non- negative on [wo,w]. 

We use the same definitions for Yt, Mt and At as above. In place of Proposition ri.5 we 
have 

Proposition 6.6. Assume C G [NCR]. There exists k{r) satisfying k{r) — )• as r —)• 00 

such that for any p small enough, r large enough, n large enough and all t: 

(1) ^{-w) - ^{-p), ^{p) - '^{w) ^ /c(r)n^/3_ 

(2) IEM2 ^ k{r)tn'^l-\ 

(3) \At\ ^ k{r)t with probability 1. 

The proof is again deferred. Now, as before 

2 (6.24) 

[[L — K[r))e ' — K(r) ) 

where the last inequality is Chebyshev. This goes to zero as r — )• 00 for any 7. This shows 
(6.21). Similarly, 











IP-pl 










< 




k{r)e 



k{r)e 



7 



((1 - k{r))et - k{r)f 



and this shows (6.22). 

Segment [—w,vo\. Here we still assume (6.18), but instead of absorbing Zt at the bound- 
aries, we shall now suppose that Zt evolves like a symmetric random walk with ibn~^ 
steps, once it exits [— pO)+Po]- 

We first show that u can be chosen to vanish at infinity. Consider the process Ut = 
{Uq - {Zt - Zq + Su-^/H), for (5 > with Uq to be chosen later and set N = inf{t : ^ 0}. 
Then, by the definition of the [NCR] regime for n large enough Ut/\N is a non-negative 
supermartingale satisfying the requirements of Lemma 2.3 and hence 

P_^(r+ > ti^^%n)) = ^-^{Zt <w yti^ 4''^(n)) 

^ F^^{Ut>Uo + Zo-{w + 6e''n-^/^) Vt ^ ^'"(n)) 

= P_^(iV > 4'°(n)) 

^/dn-ie7/2n2/3 ^ ^ ^ 

where we choose Uq = w + 6e'^n~^^^ — Zq and 6 = e~^ . The last expression can be made 
arbitrarily small by taking 7 large enough, uniformly in n if it is sufficiently large. 
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To show that u can satisfy ^(7) < 1 for all 7, we have to show that Z can cross 
from —w to w in t^''^(n)-time for arbitrarily small 7. If p is small and n is large, then 
Xt = n{Zt — {—w)) satisfies the conditions in Lemma 2.2 with 6 = C(^)~ a = d. 
Therefore 



rn 



^ Ci exp{-C2(2re-^/2 + e^/^C{n)-n'^^^f} + 0(n-2/3) 

which is positive for all 7, once n is large enough. This proves (6.23) and concludes the 
proof of the upper bound. 

To show the lower bound in the [NCR] case, set Vt = Zt — 5n~^/^t + w and choose 
(5, r > such that V,. + has non-positive drift whenever V,. + ^ 0. Then, 



Zt ^ w) 
V,.+ ^2w- (5e^n-^/^) 



and part ( -) of Lemma ; shows that the last expression goes to as 7 — )• —00 uniformly 
in n (large enough). This proves ((i.7) and completes the [NCR] case. □ 

It remains to prove Propositions 6.4-6.6. 

Proof of Proposition 6.4- Let r* = min |r~, t(n), miuj ^ 2 min{t ^ : 5^ ^ ^}|. Fix some 
2 ^ i < j ^ q and set 

Yt = Sl-S{. 

Let Ut-i = Yt — Yt-i — Eo-Q [5^ — Yt-i \ J-t-i] and then since \Yt — Yt-i\ ^ ^ we have that 
\Ui\i^^. Define the process Zt by = and 

Zt - Zt-i := sign(Zj„i)sign(yt_i)[/t_i 

where 

'l X ^ 0, 



sign(x) 



-1 x<0. 



With this definition Zt is clearly a martingale and since \ Zt — Zt-i\ ^ ^, then Eo-q^^^ ^ ^ 
and so by Doob's maximal inequality, 

^ ^ ^2 2>2tin) 



max I I 

^ t s£ t{n) 



(6.25) 



Now when t < r* we have that SI, Si < - and so 



^2/3(5^-1) _g2/3(5^-i) 



^ 2/3 



si-s{ 



By Jensen's inequality X]fc=i e^^^^^ 1^ q so 



E^, [sign(yt_i)(yt - Yt-i) I Tt-i] 



sign(5,*_i - SU 



e2/3{5,^-i) 



k=l 



{sU-sU 



q-2{3 
qn 



^ 0. 



(6.26) 
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Now when |lt-i| ^ - we have that — = sign(yj_i)(lt — Yj-i) and we always 

have 

\Zt\ - \Zt-i\ ^ sign{Zt-i){Zt - Zt-i) = svga{Yt-i)Ut-i 
with equahty when sign(Zt) = sign(Zj_i). Hence it follows that when |it-i| ^ - and 

\Yt\ - \Yt-i\ = sign{Yt-i){Yt - Yt-i) 
^ s\g-a{Yt-i)Ut-i 
^ \Zt\ - \Zt-i\ 

where the first inequality follows from equation (6.26). It follows by induction that 
\Zt\ ^ 1^1 — - for all t ^ r*. In particular we have by equation (6.25) that 



CO 



max max 

2 s£ j<i ^ <? ^ t ^ r* 



si-si 



o 



t{n) 



o(l). 



(6.27) 



By Markov's inequality with probability tending to 1 we have that 



Si, - Si, 



o(l) for 



every pair 2 ^ i, j ^ q. Now by construction S^, ^ y — - so with high probability we have 



that S** ^ g ~ "^nf'+'^l-'-) < g which implies that with high probability r* = min{t(n), }. 
Now, by Taylor series expansions, 



^ 



\i=2 



^ q — 1 ^ \ (7 — 1 



i=2 

^ o 



.1-5^2 



max 

2 s£ i<j ^ q 



SI - SI 



i=2 
2N 



where the first inequality is by Jensen, and we have used the fact that Yli=2 SI = ^ — Sj. 
It therefore follows that with high probability for all ^ t ^ max{t(n),r~} that 

1 1 e^/^^'-i 1 
E^o [S'i - S^_i I I't-l] = — S^_^ 



Sli - o 



max 

2 ^ i<j ^ q 



and hence that 



n J^^J . Dp{Sl_i) - nE^„ [Sl - S}_i I J"t_i] 



max max 

2 ^ i<j ^ qO ^ t ^ T* 



SI - Sl 



which combined with equation (6.27) and Markov's inequality completes the result. □ 
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Proof of Proposition 6.5. Starting with part 



^ ndx ^ I'P / n n . ^ 



2n 1 / pJE \ fP ( bx^ + cx^ ^ f (bx^ + cx^) 



2 



\/<(n) \VC^) J-p\ (C(n) + ax2)2 \^(C(n) + ax2)3 

+ 0(n) + \n[\ 



vr n ^, ^ ( ( cx'^ {bx^ + cx^)^\ , 



2 nrj4 /-( 3 /yG 



P 



t^'^in) + 0{n) + 0{n dx] = t^'^in) + 0{n). 



To prove part (i), we use the law of total variance: 

Var Mt = VarE[Aft|Ji_i] + E Var[Mt| 
= VarMt_i + ¥.VaT[Yt\Ft-i] 

< VarMt_i + max W {z)? ¥.Ysj[Zt - Zt-i\Tt-i] 



\z\ ^ 2po 

^ VarMt_i + 



n2 1 



(^2(n) n2 

Hence by induction 



As for part (3), 
At+i-At = E[Yt+i-Yt\Tt] 



E[^{Zt+i)-^{Zt)\Tt]-l 



E 



^'{Zt){Zt+i - Zt) + 0{ max \^"{z)\ {Zt+i - Ztf) 

\z\ ^ 2p ' 



■^'{Zt)E[Zt+i - Zt\Ft] + 0{ max W'{z)\n-^) - 1 

kl 2po 

Ofinax |^"(z)|n-2) = of— V 



where the last inequality follows from 



•2 



n 



\2az + Sbz"^ + Acz^l 



(C(n) + + bz^ + cz4)2 



(C(n) + C2z2)2 VC^/'H 
if /3 is small enough. Then again by induction, we conclude that 
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Proof of Proposition 6.6. If r is large enough and p is small, we have f{z) ^ n ^{a/2)z^ 
for all wq < \z\ < po, where as before wq = w/2 and po = 2p. This immediately gives part 
(1) with k{r) = Cr-\ 

The proof for parts (2),(3) are similar to the ones in Proposition G..^*. This time the 
bounds on the derivatives become 

max ^ C7r-^n^°/3 ; max \^" {z)\ Cr'^n^ . 

wo/2<\z\<2po wo/2<\z\<2po 

Proceeding by induction as before, we obtain ( ), ( ) with k{r) = Cr^^ and k{r) = Cr^"^ 
respectively. □ 

7. Essential Mixing 

Proof of Theorem As the reader can verify, most statements in Sections ■ and \ hold 
when (3 < f^dQ) and even f3 < q/2 (the restrictions on /3 are indicated before each statement 
there). The only time f3 < f3s{q) < (3c{q) is required is in step (I ) of the overall coupling, 
where the condition ensures that the drift of each single coordinate SI is negative in all 
(1/g, 1], which, in turn, implies that for any initial configuration, after t = 0{n) time, 
at G Tin, which is a necessary starting point for the couplings that follow. 

Now, if /3 ^ Psig), but still [3 < (3c{q), we may replace this step, with the requirement 
that ctq is initially chosen from S„ = S^. The analysis of the overall coupling will remain 
the same, with the coalescence time being even smaller (but just by a linear term, which 

can be absorbed in the cutoff- window term). Thus, the restricted mixing time ^^ix(e)("') 
will be upper bounded as before. In addition, the lower bound in Subsection 4.1 will also 
hold for i^"x(£)('^)) since as initial configuration, we may take any (Tq € dp^Tnip) for p > 0. 

It remains to show that S„ \ S„ has an exponentially decreasing probability under pn- 
This follows immediately from the large deviations analysis in Subsection 2.2. If /? < l3c{q), 
the rate function Ip^q is strictly positive away from e and in particular there exist Ci > 0, 
C2 > 0, such that 

Pn (Sn \ S„) = 7r„ {Sn \ ) < de'^^". 
This concludes the proof of the theorem. □ 
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